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Gene therapy deserves a fresh chance 


Initial interest in gene therapy waned after the technology failed to live up to expectation. Progress made 
since has received little attention, but suggests that the pervading sense of disillusionment is misplaced. 


n the early 1990s, when the first human trials got under way, it 
seemed to many that the era of gene therapy was at hand: the tech- 
niques of modern molecular biotechnology would make it pos- 
sible to repair genetic defects by inserting healthy DNA directly 
into a patient’s cells. The excitement was short-lived. Lasting effects 
proved difficult to obtain in early trials, and the community quickly 
grew sceptical. Then, in 2003, when it was announced that several 
gene-therapy patients in a Paris-based clinical trial had devel- 
oped leukaemia, and that one of them had died, the mood became 
bleak. Subsequent reports of successful and effective gene-therapy 
trials have done little to lift the prevailing sense of doom. For most 
researchers, gene therapy now seems like a dead end. 

But it doesn’t have to be a dead end — not if scientists shift their 
perspective on the risks of gene therapy to be more in line with that 
of clinicians. 

Scientists are trained to focus on understanding the systems that 
they study in great detail. And when they devise therapeutic interven- 
tions — for example, harnessing a viral shell to insert a therapeutic 
gene into a patient’s DNA — they naturally want those systems to 
be engineered with equally great care, and for them to be as near to 
risk-free perfection as possible. 

Clinicians, by contrast, care for real patients in real time, which 
makes treatment decisions a matter of pragmatism. How do the 
risks stack up against the benefits for each available alternative 
— given that the risks are never zero? Clinicians are certainly not 
cavalier about their patients’ well-being, but they may well end up 
prescribing a therapy that has a poorly understood mechanism and 
potentially large side effects because it gives the patient the best 
odds of recovery or survival. If they — and patients — had shied 
away from such dangers in the past, life-saving interventions such 
as organ grafts and bone-marrow transplants might never have 
been developed. 


From that perspective, the fact that, collectively, the Paris trial 
and others carried out since have produced positive results in some 
20 patients out of a total of two dozen looms at least as large as the 
handful of leukaemia cases. To clinicians, such results suggest a treat- 
ment that is risky, but potentially life-saving — a new option for peo- 
ple for whom there are no alternatives. 

However, this was not the view that prevailed. When the viral 
delivery vehicle itself turned out to be 
responsible for the leukaemia cases in 
the Paris trial, scientists deemed the 
trial a failure. Bad press ensued, pro- 
posals for gene-therapy clinical tri- 
als came under increased regulatory 
scrutiny and standards for demonstrating safety were set higher 
than for other approaches. Unsurprisingly in such a climate, the 
biotechnology and pharmaceutical industries gradually dropped 
out of the gene-therapy pursuit. This corporate disinterest slowed 
clinical progress: academic centres are ill-equipped to make gene- 
therapy vectors of clinical grade and scale, and research funding is 
typically insufficient to support clinical trials. More insidiously, it 
has become harder to recruit young talent to a field that is perceived 
as falling short of its promises. 

To reverse this trend, it is time for researchers and industry to 
refresh their perspective on gene therapy and to consider its suc- 
cesses with as much intensity as its setbacks. The focus on adverse 
events has had positive consequences: researchers dissected the exact 
molecular mechanisms that led to cancer, designed better vectors, 
devised animal models to test these vectors and developed sophis- 
ticated assays for monitoring patients. As a result, both scientists 
and clinicians now have a battery of extraordinarily refined tools 
for preclinical and clinical studies of gene therapy. The field is ripe 
for further successes. rT] 


"The results suggest 
a treatment that is 
risky, but potentially 
life-saving.” 


Darwin and culture 


Anew series of essays traces the astounding 
variety of reactions to the theory of evolution. 


factors: people’s ability to grasp factual information and the 

cultural lens through which that information is filtered. The 
former is what scientists tend to focus on when they give popular 
accounts of issues such as climate change. The assumption is that 
if they explain things very, very clearly, everyone will understand. 
Unfortunately, this is an uphill battle. The general public’s average 
capacity to weigh facts and numbers is notoriously poor — although 


Te public reception of scientific ideas depends largely on two 


there is encouraging evidence that probabilistic reasoning can be 
improved by targeted education early in life (see page 1189). 

Even more crucial, however, are the effects of the cultural lens. Over 
the coming month, Nature’s Opinion pages will explore particularly 
vivid examples of these effects in the world’s widely divergent reac- 
tions to Charles Darwin's ideas about evolution in the late nineteenth 
and early twentieth centuries (see page 1200). 

In England, for example, the Church reacted badly to Darwin's 
theory, going so far as to say that to believe it was to imperil your soul. 
But the notion that Darwin's ideas ‘killed’ God and were a threat to 
religion was by no means the universal response in the nineteenth 
century. 

Darwin's theory reached the world at a time when many people were 
looking for explanations for social, political and racial inequalities, 
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and in many parts of the world were wondering how to improve their 
lot in the face of Europe's global imperialism. So from Egypt to India, 
China and Japan, many religious scholars embraced Darwin’s ideas, 
often showing how their own schools of thought had anticipated the 
notion of evolution. Against the threat of Western imperialism and 
Western charges of ‘backwardness; it was to their advantage to high- 
light the rationality of their creed. 

In China, Darwin's ideas were seen as supporting Confucians’ belief 
in the perfectibility of the cosmic order. Evolutionary theory also 
became fodder for political movements of revolution and reform, 
and eventually laid the groundwork for communism. Latin American 
politicians initially reacted to Darwin's ideas by attempting to entice 
white Europeans to emigrate and intermarry with local populations, 
believing that this would ‘improve the stock’. But after two world 
wars had made European culture look less impressive, Latin America 
began to see its racial diversity as an advantage, and moved towards a 
social view that favoured a homogeneous blend of cultures. 

In nineteenth-century Russia, meanwhile, a tendency to distrust 
rabid, dog-eat-dog capitalism helped incline naturalists away from 


a view of evolution that emphasized competition between species. 
Instead they embraced a ‘theory of mutual aid an account that 
focused on the role of cooperation in ensuring survival in a harsh 
environment. 

The lesson for today’s scientists and policy-makers is simple: they 
cannot assume that a public presented with ‘the facts’ will come to 
the same conclusion as themselves. They must take value systems, 
cultural backdrops and local knowledge gaps into account and frame 
their arguments accordingly. Such approaches will be crucial in facing 
current global challenges, from recessions to pandemics and climate 
change. These issues will be perceived and dealt with differently by 
different nations — not because they misunderstand, but because 
their understanding is in part locally dependent. 

Darwin once said: “But then with me the horrid doubt always arises 
whether the convictions of man’s mind, which has been developed 
from the mind of the lower animals, are of any value or at all trust- 
worthy.” Researchers and policy-makers would do well to mimic 
his humility when presenting science, and remember how people’s 
minds truly work. a 


Mind the spin 


Scientists — and their institutions — should resist 
the ever-present temptation to hype their results. 


results from an HIV vaccine trial in Thailand are troubling. 

The sponsors of the US$119-million phase III clinical trial, a 
consortium led by the US Army, the National Institutes of Health and 
the Thai government, announced on 24 September that the trial had 
been a success: an analysis of the data showed that the vaccine hada 
statistically significant effect on preventing infection. 

Other scientists could not immediately assess that claim, however: 
the full data from the trial were not made available until 20 October, 
when they were presented at an AIDS vaccine conference in Paris 
and in an article published online the same day (S. Rerks-Ngarm 
et al. N. Engl. J. Med. doi:10.1056/nejmoa0908492; 2009). The arti- 
cle contained two other data analyses, not mentioned in the initial 
announcement, showing smaller effects that were not statistically 
significant (see page 1187). 

The trial’s sponsors defend the premature announcement on the 
grounds that they had promised to inform the Thai people of the 
results first; 24 September is also Mahidol Day, the anniversary of 
the death of the king’s father and a day of national observance in 
Thailand. The sponsors also argue that announcing the less-upbeat 
analyses along with the positive result would have been too compli- 
cated for the public to understand; they wanted to quickly deliver 
a clear-cut message on the trial’s findings. Making the full data 
immediately available to scientists on 24 September would also have 
been impossible, they add, because of the conference and journal 
embargoes. 

To their credit, the scientists involved did emphasize in their 
public statements that any vaccine effect was “modest”, and that 


T= circumstances surrounding the recent announcement of 
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the vaccine itself was of no immediate public-health utility. At the 
same time, however, they hammered home the message that this 
was “the first time an HIV vaccine has successfully prevented HIV 
infection in humans’, and implied that the event was somehow 
historic. Such statements, together with the selective initial 
presentation of the data, are well outside the scientific norms for 
presenting the results of clinical trials. They inevitably create suspi- 
cion that the trial sponsors may have put an excessively positive spin 
on results that are far from clear-cut, 
in a trial that has long been contro- 
versial (T. V. Padma Nature Med. 10, 
1267; 2004). The trial has also been 
six years in the works, and so there 
seems no particular public-health 
urgency to justify publication by press 
conference. 

Fortunately, such stories are still 
rare in science. Witness the way sci- 
entists have behaved since the begin- 
ning of the current H1N1 flu pandemic, in which the urgent threat 
to health creates legitimate tensions between getting results out fast 
and respecting peer review. Most researchers have negotiated this 
tension well, through a combination of fast-track publication by 
journals and online pre-publication sharing of preliminary data — 
but not through hyping their results. 

Yet the temptation for scientists and their institutions to spin their 
research to the media, or to go publicity-mongering, is always there. 
And — as illustrated by the excessive public-relations campaign sur- 
rounding Ida, a fossil presented as a missing link in human evolution 
(see Nature 459, 484; 2009 and 461, 1040; 2009) — too many in the 
media will buy into the initial hype. 

Such behaviour is corrosive to the process of scholarly scientific 
communication. Research institutions must not allow it to become 
the norm. a 


"The trial sponsors 
argue that announcing 
the less-upbeat 
analyses along with 
the positive result 
would have been too 
complicated for the 
public to understand." 
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Chill out 


Am. Nat. doi:10.1086/648065 (2009) 

Sea stars know how to keep their cool when 
the weather heats up. By sucking up cold 
water while submerged at high tide, they can 
avoid overheating after the waves roll out. 
Sylvain Pincebourde, now at the University 
of Francois Rabelais in Tours, France, and 
his colleagues exposed ochre sea stars 
(Pisaster ochraceus) to simulated tidal 
cycles and various water and ambient 
temperatures in laboratory aquaria. They 
found that the intertidal predators increase 
the amount of colder-than-air fluid in their 
internal cavity after exposure to elevated 
aerial temperatures during low tide. This 
substantially reduces the sea stars’ body 
temperatures, as shown in the infrared image, 
during subsequent low tides. 


B. HELMUTH/S. PINCEBOURDE 


PHYSICS 
Quantum speed limit 


Phys. Rev. Lett. 103, 160502 (2009) 
The processing speed of computer chips has 
doubled almost every two years for the past 
40, as engineers have crammed ever more 
transistors into smaller circuits. But according 
to Lev Levitin and Tommaso Toffoli of 
Boston University in Massachusetts, chips 
will ultimately hit a roadblock, limited by 
the minimum time it takes for a particle to 
flip from one quantum state to another — a 
fundamental step in any information system. 
There are two independent bounds on 
this minimum time — one based on the 
average energy of the quantum system, the 
other based on the uncertainty in the system's 
energy. In their calculations, Levitin and 
Toffoli unify the bounds and show there is an 
absolute limit to the number of operations 
that can be achieved per second by a computer 
system of a given energy. Levitin says that, at 
the current doubling pace, computing speed 
will reach this limit in about 80 years. 


BIOLOGY 


How cockroaches steer 


J. Exp. Biol. 212, 3473-3477 (2009) 

Many animals can sense Earth’s magnetic 
field, but for some species, it remains 
uncertain whether this ability depends on 
embedded magnetic particles or magnetically 
sensitive ‘radical pair’ chemical reactions of 
light-sensitive molecules. 

Martin Vacha and his colleagues at 
Masaryk University in the Czech Republic 
exposed American cockroaches to a magnetic 
field in which the position of magnetic 
north changed by 60° every 5 minutes. This 
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normally makes cockroaches restless. The 
team also applied a radio-frequency field at 
only a fraction of Earth's field intensity to jam 
the creature's magnetic sensing system. At 

a certain frequency, the cockroaches stayed 
calm. Other radio frequencies had the same 
effect, but at higher field strengths. 

Because the radio-frequency field should 
not affect a magnetic-particle-based sensor, 
the result suggests that insects use a radical- 
pair-based method of sensing magnetic fields. 


NANOTOXICOLOGY 
Lung penetration 


Nature Nanotechnol. doi:10.1038/nnano.2009.305 
(2009) 
When inhaled by mice, multiwalled carbon 
nanotubes (CNTs) can embed themselves in 
the lining of the lung (pictured below). 
James Bonner at North Carolina State 
University in Raleigh and his colleagues 
exposed mice to nanoparticle aerosols of 
either 30 milligrams per cubic metre or 
1 milligram per cubic metre for six hours. 
In the mice exposed to the higher level, 
immune cells called macrophages (pictured) 
engulfed the nanotubes and carried them to 
the lung lining. 
Within weeks of exposure, those mice 
also developed a condition called subpleural 


CNT 


Macrophage 
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fibrosis, which causes localized fibrous lesions. 
This work does not confirm the suggestion 
made by other studies that nanotubes may 
cause lung tumours, but the authors say 

they urge caution be taken when people are 
exposed to nanotubes in the air. 


CLIMATE CHANGE 


Stormy warming weather 


Geophys. Res. Lett. doi:10.1029/2009GL039810 
(2009) 

Arctic residents have complained that 
their weather is changing, and a study by 
researchers in Australia backs this up. 

Ian Simmonds and Kevin Keay of the 
University of Melbourne examined the 
relationship between the number and 
strength of cyclones each year in the Arctic 
basin and the extent of Arctic sea ice during 
the month of September for the past 30 years. 
They found that years with the least amount 
of ice had significantly stronger storms, 
although there was no correlation with the 
number of storms. 

The findings support previous forecasts 
that the decline in sea ice, and particularly the 
record lows of recent years, is raising the risk 
of stronger storms in the Arctic. 


CANCER BIOLOGY 


Double protection 


Sci. Trans. Med. 1, 3ra7 (2009) 
Blocking a key cell-signalling pathway 
increases certain tumours sensitivity to high 
doses of ionizing radiation while protecting 
healthy tissue from the harmful rays. 

David Roberts of the National Cancer 
Institute in Bethesda, Maryland, and his 
colleagues inhibited the cell-surface receptor 


CD47 or the protein that binds it, TSP1, 
which regulates cell growth and survival in 
response to stress, such as that caused by 
radiation. They found that suppressing the 
CD47-TSP1 pathway in normal human cells 
improved their survival after irradiation and, 
in mice, led to reduced radiation injury. 

In addition, the tumours of mice treated 
with a CD47-blocking molecule prior to 
radiation exposure were up to 89% smaller 
30 days after irradiation than those of mice 
receiving radiation alone. 


BIOPHYSICS 
All seeing eye 


Nature Photon. doi:10.1038/nphoton.2009.189 (2009) 
Polarized light is used in optical devices, 
including some microscopes. Being able to 
control polarized light is key. Materials such 
as crystals can do the job, but only within a 
limited range of wavelengths. 

Nicholas Roberts of the University of 
Bristol, UK, and his colleagues have worked 
out how a species of mantis shrimp can 
switch polarized light from one form to 
another over a range of colours. A thin band 
of specialized receptor cells in the eyes of 
Odontodactylus scyllarus have just the right 
structure, dimensions and composition to 
enable them to control polarization over most 
of the visible spectrum. The team believes 
that further study of this mantis shrimp’s eyes 
could lead to better optical devices. 


NEUROSCIENCE 


Brain signal source 


Proc. Natl Acad. Sci. USA doi:10.1073/ 
pnas.0905509106 (2009) 

Functional magnetic resonance imaging 
(fMRI), used to map brain activity, gives a 
signal when the levels of oxygenated blood 
increase. The signal is often preceded bya 
darkening, thought to indicate early oxygen 
absorption by the brain owing to local 
neural activity. 

But a study by Aniruddha Das and his 
colleagues at Columbia University in New 
York casts doubt on this. They used intrinsic 
signal optical imaging, a technique similar to 
fMRI, to measure changes in blood volume 
and blood oxygenation in the brains of two 
macaques while they performed a visual task. 

The researchers found that during the 
initial darkening, blood-oxygen levels 
changed little, but blood volume increased 
markedly. The team suggests that blood- 
volume change is a better signal to use in 
brain imaging because it seems to be more 
closely linked to neural activity, occurring 
even before changes in blood oxygenation. 


SEXUAL SELECTION 


Intruder alert! 


Proc. R. Soc. B doi:10.1098/rspb.2009.1554 (2009) 
Male redback spiders can sneak in and 
quickly copulate with a female after a rival 
male has already spent hours wooing her, yet 
avoid the usual penalty of short courtship — 
being eaten prematurely by his lover. 

The female Australian redback spider 
(Latrodectus hasselti, pictured below) eats the 
male after mating, but sometimes consumes 
him prematurely — after he’s copulated only 
once. Jeffrey Stoltz and Maydianne Andrade of 
the University of Toronto in Canada measured 
the spiders’ courtship durations and found that 
females tend to eat their partners prematurely 
if courtship is less than 100 minutes long. 
However, intruder males can mate after a 
shorter courtship and avoid premature death 
if an earlier male had already exceeded this 
100-minute threshold. 

This could lead to lower quality males 
seeking out, rather than avoiding, competition 
with rival spider studs, the authors say. 

For a longer story on this research, see 
http://go.nature.com/U6DPEG 


ASTRONOMY 


Galaxy size matters 


Astrophys. J. 705, 255-260 (2009) 
A survey of distant galaxies shows that more 
loosely packed ones tend to form more stars. 
The survey looked at 225 galaxies at 
distances of between about 2.8 and 3.4 parsecs 
from Earth. It found that compact galaxies 
tend to have fewer new stars than do their 
larger counterparts of comparative mass. 
Sune Toft of the University of Copenhagen 
and his colleagues conclude that compact 
galaxies formed many stars quickly in one 
intense burst, early in the history of the 
Universe. Conversely, larger, more diffuse 
galaxies form stars gradually over a longer 
period of time. The results may explain why 
very distant galaxies are often more compact 
than the younger ones nearby. 
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JOURNAL CLUB 


Jonathan Weissman 
University of California, San 
Francisco 


A biochemist looks at how DNA 
sequencing can reveal more than 
just sequences. 


Huge advances in DNA sequencing 
have allowed us to readily 
determine the sequence of almost 
any living (and a few extinct) 
species. Yet arguably, most 
biological insight comes from 
work on five model organisms: 
Escherichia coli, baker's yeast, 
roundworms, fruitflies and mice. 
Unfortunately, many important 
biological processes are not 
captured in these creatures. 

Papers from two groups, 
one led by Andrew Camilli 
of Tufts University in Boston, 
Massachusetts, the other by 
Brian Akerley at the University 
of Massachusetts in Worcester, 
describe new genetic tools that 
allow the quantitative dissection 
of gene function in a wide range of 
microorganisms (T. van Opijnen 
et al. Nature Methods 6, 767-772; 
2009; and J. D. Gawronski et al. 
Proc. Natl Acad. Sci. USA 106, 
16422-16427; 2009). These studies 
combine exhaustive transposon 
mutagenesis — whereby thousands 
of small DNA segments, or 
transposons, are introduced into 
the genome to mutate many 
genes — with massively parallel, or 
‘deep’ sequencing of transposon/ 
chromosome junctions to monitor 
the consequences of the loss of 
single or pairs of genes on the 
organisms ' traits. 

The real power of the 
approaches comes from the 
deep sequencing, which tracks 
the abundance of individual 
transposon mutants after they 
have been subjected to a stress. 
Knowing by how much each 
mutant has grown or suffered 
under the stress provides a 
measure of the relative roles that 
the mutated genes have. 

| find it particularly gratifying 
that the advances in deep 
sequencing that have allowed us 
to catalogue so many genes from 
so many organisms can now be 
harnessed to help us figure out 
what these genes actually do. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 
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POLICY 


Space-flight review: The US 
panel charged with reviewing 
NASA’s human space-flight 
programme issued its final report 
last week, and warned that the 
current programme seems to be 
“on an unsustainable trajectory”. 
Present funding doesn't match 
the space agency’s targets, says 
the commission, which is chaired 
by ex-aerospace executive 
Norman Augustine. Many of its 
suggestions, such as bypassing 
human exploration of the Moon 
and scrapping the Ares-I rocket 
in favour of commercial space 
flights, had already been aired in 
public meetings (see Nature 460, 
791; 2009). 


European research reform: 
The European Commission 

has agreed that a top scientist 
should lead the administrative 
and managerial activities 

of the European Research 
Council (ERC), in place of the 
commission’s current appointee, 
economist Andreu Mas-Colell. 
The decision, announced on 

22 October, came in response 
to a damning review of the ERC 
published in July (see Nature 
460, 557; 2009), which called 
for immediate reforms to the 
council’s management. Fotis 
Kafatos, president of the ERC, 
said that the commission’s 
response was welcome but “not 
particularly revolutionary”. See 
go.nature.com/Eh8n43 for more. 


Nuclear vision: Germany's new 
coalition government will extend 
the lifespan of the nation’s nuclear 
power plants — which last year 
produced around 23% of the 
country’s electricity needs — 
beyond 2022. But the Christian 
Democratic Union and its junior 
coalition partner, the liberal Free 
Democratic Party, will not revise 
an existing ban on building new 
nuclear plants. In a 24 October 
policy plan, the coalition also 
agreed to “immediately” lift a 
moratorium on evaluating the 
merits of the Gorleben salt dome, 
a controversial storage site for 
nuclear waste. 
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HWANG CONVICTED 


Disgraced South Korean cloning scientist Woo Suk Hwang left Seoul Central District Court on 26 October knowing 
that his sentence, a two-year prison term suspended for three years, could have been worse. He was found guilty 
of embezzling government funds and buying human eggs in violation of the country's bioethics law, but was 
cleared of fraud. The prosecution had sought a four-year jail term, and plans to appeal. See page 1181 for more. 


Mercury deadline: The US 
Environmental Protection 
Agency (EPA) has agreed 

to set new rules governing 
emissions of mercury and other 
toxic chemicals from power 
plants by November 2011, 
according to a settlement in a 
federal lawsuit filed by several 
environmental and health 
groups. Environmentalists say 
that the Clean Air Act required 
the EPA to set limits by 2002, 
but the administration of former 
US President George W. Bush 
avoided this in part by creating a 
market-based system that would 
allow mercury emissions to 
continue at some plants as long 
as they dropped in aggregate. 
However, Bush’s work-around 
was later deemed illegal in 
federal court. 


Polar-bear protection: The 

US Fish and Wildlife Service 
proposed on 22 October to 
designate around 500,000 square 
kilometres of “critical habitat” 

— 96% of which is sea ice — for 


CRUNCH 


of Americans think 
there is “solid evidence 
the Earth is warming", 
according to an October 
2009 poll. 


of Americans thought this 
in April 2008. 


Source: Polls conducted by 
Washington-based Pew Research 
Center for the People & the Press 
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the polar bear. The bear was 
listed as a threatened species in 
2008 owing to projections of 
sea-ice declines caused by global 
warming. The government 

is already obligated to avoid 
actions that jeopardize the bear, 
but the designation would add 
another layer of protection by 
also making it illegal to conduct 
activities that adversely affect the 
bear’s habitat. 


Vaccine report: More children 
than ever are being immunized, 
but 24 million infants in 

the world’s poorest nations 

still do not receive routine 
immunization, according to 

a report by the World Health 
Organization, UNICEF and the 
World Bank. The 21 October 
State of the World’s Vaccines and 
Immunization report says that 
although four in five children 
now have access to life-saving 
vaccines, at least another 

USS$1 billion is needed annually 
to help raise immunization rates 
above 90%. This would cover 


AHN YOUNG-JOON/AP 
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the rising costs of immunization 
and could prevent two million 
children from dying per year 

by 2015. See go.nature.com/ 
APMPtB for more. 


GM protests: Environmental 
groups are protesting after 

the Mexican government's 

15 October approval of the first 
permits to plant experimental 
genetically modified (GM) 
maize (corn). Growing of GM 
varieties had previously been 
outlawed in the country, which 
is the homeland of domesticated 
maize. Mexican environmental 
and agricultural agencies say 
that they will keep plantings 
away from traditional ‘landrace’ 
maize, and will monitor the 
experimental crops closely 
before considering requests 
from agribusiness for full- 

scale GM maize planting. But 
researchers say that past landrace 
contaminations from illegal GM 
maize planting (see Nature, 456, 
149; 2008) mean corruption of 
traditional genomes is inevitable. 


@ FUNDING 


Energy funding: The US 
Department of Energy has 
awarded $151 million to 37 
research projects through the 
recently formed Advanced 
Research Projects Agency- 
Energy (ARPA-E). Based 

on the Defense Advanced 
Research Projects Agency, 
ARPA-E is geared towards 
high-risk ‘transformational’ 
energy research that might not 
be funded through traditional 
science grants. Awardees 
included small businesses, 
educational institutions and 


Large Hadron 
Collider 


Physicists last week 
injected particles into the 
accelerator for the first 
time since an accident 
forced it to shut down in 
September 2008. 


large corporations that focus on 
everything from liquid-metal 
batteries and gasoline-producing 
bacterial reactors to new methods 
for making light-emitting diodes 
and synthetic enzymes for 
capturing carbon dioxide from 
industrial emissions. 


Innovation fund: The United 
States is following up on 
promises to facilitate a global 
fund to trigger innovation and 
technology development. On 


23 October the government's 
Overseas Private Investment 
Corporation issued a call 

for proposals for a Global 
Technology and Innovation 
Fund, aimed at countries in Asia, 
the Middle East and Africa. 
Each selected proposal would 
receive between $25 million and 
$150 million; potential areas 
include clean technology and 
information technology. 


@ RESEARCH 


Scientific espionage: A former 
Los Alamos nuclear-weapons 
physicist says that he is under 
investigation for espionage. 

The researcher, P. Leonardo 
Mascheroni, spoke to the 
Associated Press on 22 October, 
two days after he says FBI 
agents raided his home. The 
bureau confirmed an “ongoing 
investigation” into his activities. 
Mascheroni, who worked 

in the lab’s X Division in the 
1980s, says that last year he gave 
unclassified information — 
widely available on the Internet 
— toaman claiming to be from 
the Venezuelan government 
who asked for information about 
starting a nuclear-weapons 
programme. 


HIV vaccine doubt: Results of 
the largest-ever HIV-vaccine trial 
looked less impressive when full 
details were formally published 
last week (S. Rerks-Ngarm et al. 
N. Engl. J. Med. doi:10.1056/ 
nejmoa0908492; 2009) than 
when they were outlined in a 
press release a month earlier. In 
September, the trial was said to 
show that a vaccine combination 
reduced the risk of HIV infection 


THE WEEK 
AHEAD 


29 OCTOBER-1 NOVEMBER 
Philadelphia hosts the 47th 
Annual Meeting of the Infectious 
Diseases Society of America. 

> go.nature.com/ykfvNW 


29-30 OCTOBER 

A European Council summit 
meeting in Brussels may firm 

up European promises to 
finance climate-change action in 
developing countries. 

> go.nature.com/1kWXLs 


2 NOVEMBER 

The European Space Agency 
is scheduled to launch its Soil 
Moisture and Ocean Salinity 
satellite. 

> go.nature.com/sHQ161 


2-6 NOVEMBER 

Nairobi, Kenya, hosts the 
Multilateral Initiative on 
Malaria's fifth Pan-African 
Malaria Conference. 

> www.mimalaria.org/pamc 


2-6 NOVEMBER 

The United Nations Framework 
Convention on Climate 

Change holds its fifth round of 
international climate talks this 
year in Barcelona, Spain. 

> go.nature.com/QsS4jx 


by nearly one-third. But Peter 
Smith, a tropical epidemiologist 
at the London School of Hygiene 
& Tropical Medicine, says “there 
is not much evidence from the 
data that it protects at all” See 
page 1187 for more. 


BUSINESS WATCH 


Beginning its rehabilitation after a dismal 
funding scene earlier this year, venture- 
capital financing in the United States saw 
asmall improvement in the year's third 
quarter (to 30 September). It increased by 
17% over the previous quarter to $4.8 billion, 
according to a 20 October MoneyTree 
Report by PriceWaterhouseCoopers and the 
National Venture Capital Association, based 
on data from Thomson Reuters. 

The clean-technology sector, which 
spans industries such as alternative energy, 
conservation, pollution-scrubbing, recycling 
and power supply, was responsible for 
more than half of the rise. Venture-capital 


funding in this sector grew by 89% from the 84 
previous quarter to $898 million in 57 deals. 


A few large funding rounds fuelled this 
increase; three of the top ten deals of the 
third quarter went to clean-tech companies 
in California: $286 million to Solyndra of 
Fremont (photovoltaics), $82.5 million 

to Tesla Motors of San Carlos (electric 
vehicles) and $60 million to Serious 
Materials in Sunnyvale (energy-efficient 
building materials). 

Biotechnology continues to be the top 
funded sector, receiving $905 million in the 
third quarter — a 4% decrease from the 
second quarter. 
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African science feels the pinch 


Recession dampens donors’ enthusiasm. 


DURBAN 

The global financial crisis is hampering plans 
to revive African science, researchers and 
policy-makers said last week in Durban, South 
Africa. Slashed donor funding, slowing foreign 
investment and competing budget priorities 
are the main culprits; hardest hit are the poor- 
est countries and continent-wide projects. 

“Our countries are in crisis, philanthropists 
are in crisis and the aid agencies are in crisis,” 
Jean-Pierre Ezin, commissioner for science, 
technology and human resources for the Afri- 
can Union, said at a conference organized by 
TWAS, the academy of sciences for the devel- 
oping world, based in Trieste, Italy. 

In 2007, an African presidential summit on 
science saw funders falling over each other 
to offer assistance on science and technology 
programmes. Today the funding situation has 
changed dramatically. 

The Swedish international development 
agency SIDA said last month that it would 
cut funding for its research cooperation 
programmes with developing countries by 
20%, from an estimated 1.05 billion kronor 
(US$150 million) to 800 million kronor. 
Britains Wellcome Trust, which funds several 
medical research projects in Africa, cut overall 
grants for 2008-09 by £30 million (US$50 mil- 
lion), to £590 million. Many people expect aid 
levels to fall this year as a result of the financial 
crunch, although the Organisation for 
Economic Co-operation and Devel- 
opment says effects are more likely 
to be felt in future years, as 2009 aid 
budgets were mostly finalized before 
the recession hit. 

Not all the news is bad. Several 
American philanthropists, including 
the Bill & Melinda Gates Foundation, 
said earlier this year that they would not cut 
research funding. The Gates Foundation even 
said that it would increase grants, despite a 20% 
drop in assets last year. 

Still, Ezin says that the African Union won't 
be able to fulfil all its planned science activi- 
ties for 2009 and 2010. Instead, his department 
will prioritize the Pan-African University, a 
network of existing African institutions that 
will train PhDs and carry out research, anda 
grant programme for African researchers. But 
a programme to train science teachers, he says, 
may have to be scaled back. 

University researchers are also feeling the 
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Broken hopes: research projects between African universities and other countries are being cut back. 


pinch. In Senegal, a plan to expand the coun- 
try’s university system has come to a standstill. 
“The crisis came and everything stopped,’ says 
Lamine Ndiaye, a former vice-chancellor of the 
University of Gaston Berger in Saint-Louis, 
Senegal. Funding has dried up from the gov- 
ernment and from France, the country’s main 
development partner, he says. 

Countries that don't depend on aid 
are also struggling. In Nigeria, the 
drop in demand for oil and gas, exac- 
erbated bya stricken banking sector, 
means that private donations — a 
major source of funding for Nigerian 
universities — are slowing. “In the 
past, a conference like this would have 
a lot of Nigerians coming, supported by indus- 
try grants. We don’t find many today,” says 
Oye Ibidapo-Obe, president of the Nigerian 
Academy of Science in Lagos. 

Nigeria's government wont pick up the slack 
left by the drop in private investments, Ibidapo- 
Obe adds. “Research is not seen as the major 
driver of the economy.” 

Even South Africa, the continent’s economic 
powerhouse, is facing a lean year. The country’s 
coffers have been depleted by its worst finan- 
cial performance since the end of apartheid 
15 years ago, says Naledi Pandor, the science 
minister. 
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Pandor says she has been assured by the 
country’s treasury that her department won't 
face cuts in the mid-term budget, due for 
release as Nature went to press. But the dip 
in the country’s growth rate means that the 
department, which was given 4.2 billion rand 
(US$560 million) for 2009-10, may have to 
put some planned projects on the back burner. 
Probable cuts include a 700-million-rand semi- 
commercial titanium test facility, which could 
be either delayed or dropped completely. 

Recent progress in building up African 
science could be lost, warns Mohammed 
Hassan, executive director of TWAS. In the 
1980s, African governments responded to a 
steep economic decline by cutting their higher- 
education budgets, he notes, and Africa went 
from having some of the best universities in the 
developing world to some of the worst. 

Although the world economy is starting 
to recover, the worst may be yet to come for 
Africa, says John Muyonga, a food scientist 
at the University of Makerere in Uganda. His 
institution depends “close to 100%” on donor 
funding, he says. Most of his colleagues have 
grants that cover several years, and may strug- 
gle to find new funding when these grants run 
out. “We may have more impact in 2010 than 
in 2009,” he adds. a 
Linda Nordling 
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Q&A: AHEALTHY VISION & 
Tom Frieden, director of i 
the US Centers for Disease 
Control and Prevention, 
speaks to Nature. 
go.nature.com/Py9Srb 


Woo Suk Hwang convicted, but not of fraud 


Cloning pioneer Woo Suk Hwang 
was sentenced to two years in prison 
at the Seoul Central District Court on 
26 October, after being found guilty 
of embezzlement and bioethical 
violations but cleared of fraud. 
Supporters of Hwang, a former 
professor at Seoul National 
University in South Korea, were 
pleased with the sentence, which is 
suspended for three years and half 
the length sought by prosecutors. 
The prosecution plans to appeal. 
Hwang was once féted for 
creating human stem-cell lines 
using cloned embryos derived from 
patients suffering from spinal-cord 
injury and other disorders (W. S. 
Hwang et al. Science 303, 1669- 
1674; 2004 and W. S. Hwang et al. 


Science 308, 1777-1783; 2005). The 
accomplishment, which promised 
an endless supply of stem cells 
genetically matched to patients, 
turned out to be bogus. 

Hwang admitted in January 2006 
to falsifying data, while maintaining 
that he had the ability to do what 
he had claimed. In South Korea, 
scientific fraud would be illegal only 
if Hwang had used fraudulent data 
to gain grants. Prosecutors argued 
that he duped two companies, 

SK Group and NongHyup, into 
supplying research funds. But 
according to media reports, the 
court rejected the allegations on 
the grounds that the firms provided 
money without expecting to benefit. 

The court did, however, find 


Hwang guilty of buying human 
eggs in violation of the country's 
bioethics law and of embezzling 
830 million won (US$700,000) of 
government money. 

The Korea Times reported that 
the light sentence was motivated 
by judge Ki-ryul Bae's sympathy 
for Hwang's apparent dedication 
to Korean biotechnology and his 
stated remorse. Hwang will now be 
able to focus on his research career, 
which he has been rebuilding since 
he was indicted in May 2006 (see 
Nature 461, 1035; 2009). 

Many researchers are not ready 
to welcome Hwang back. “It was 
not just one moment of weakness 
— the degree of manipulation of 
the goodwill of people, particularly 


fellow scientists, made it more,” 
says Alan Colman, a stem-cell 
scientist at the Institute of Medical 
Biology in Singapore. “The sad thing 
is that it's clear he is a talented 
experimentalist.” Colman argues 
that Hwang should not be eligible 
for research funding from public 
sources for a prolonged period. 

Researcher Ryuzo Torii of the 
Shiga University of Medical Science 
in Japan used large amounts of 
grant money, time and monkey 
eggs trying to reproduce Hwang's 
technique in non-human primates 
in 2004 and 2005. He says that 
forgiving Hwang and recognizing 
him as a researcher would be “a 
mistake”. | 
David Cyranoski 


US physicists propose astrophysics goals 


Cosmic-ray experiments would suffer, 
unless budgets increased, under a list of 
priorities for high-energy physics given 
to the US Department of Energy (DOE) 
by independent advisers last week. 

The DOE should instead fund one 
massive dark-matter detector, one major 
dark-energy experiment and a high- 
energy gamma-ray detector, according 
to the report, which was presented on 
23 October in Washington DC. 

For current budget trends, the 
report did not endorse Auger North, a 
US$127-million array of 4,400 cosmic-ray 
detectors in southeastern Colorado. This 
is due partly to the success of its southern 
counterpart, the Pierre Auger Observatory 
in Argentina, which is close to tracing the 
origin of high-energy cosmic rays to massive 
black holes in galactic centres, says Steve 
Ritz, chair of the 15-person committee that 
wrote the report. “You no longer need to 
invoke new physics to explain [high-energy 
cosmic rays],” says Ritz, an astrophysicist at 
the University of California, Santa Cruz. 

The report is the first in which US 
high-energy physicists have set priorities 
for particle astrophysics. The committee 
considered four budget scenarios, ranging 
from the relatively grim DOE budget for 
the 2008 fiscal year in years going forwards, 


Southern cosmic-ray observatory may not get a partner. 


ten-tonne vats of liquid argon or xenon, 
or cryogenic silicon detectors set deep 
underground. 

Dark energy — the mysterious vacuum 
force that makes up most of the rest of 
the Universe’s mass-energy — remains 
a priority, although the experiments 
are expensive (see page 1182). Unless 
budgets increase, the report suggests, 
the DOE will have to choose between 
supporting a space-based project, the 
Joint Dark Energy Mission, or a major 
ground-based telescope: either the 8.4- 
metre Large Synoptic Survey Telescope 
(LSST) or BigBOSS, a 4-metre telescope. 


PIERRE AUGER OBSERVATORY 


to budgets that would more than double 
over the next decade. The DOE takes the 
advice “very seriously’, says Dennis Kovar, 
the agency’s associate director of science for 
high-energy physics. 

Even in the tightest scenario, dark-matter 
experiments fared well — both because 
they are relatively cheap and because there 
are hints that discovery of these undetected 
particles, which make up nearly a quarter 
of the Universe’s mass-energy, could be 
imminent. If budget trends continue, 
however, the report says that only one next- 
generation detector can be chosen from 
among competing groups, which propose 
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“We don't have the luxury of a lot 
of redundancy,’ says Patricia Burchat, 
a physicist at Stanford University in 
California and a member of the LSST 
project. “We can't afford it.” Ritz says that 
so far, the agency hasn't looked at the trade- 
offs between ground- and space-based 
approaches to dark energy. 

For high-energy gamma-ray astrophysics, 
the report supports the Advanced Gamma- 
ray Imaging System, an array of about 50 
telescopes spread over a square kilometre 
— but only if the consortium merges with a 
similar European proposal, the Cherenkov 
Telescope Array. a 
Eric Hand 


1181 


NATURE|Vol 461/29 October 2009 


Measuring supernovae, such as 
this one (bright spot, bottom) 
in the galaxy NGC 4526, could 
help pin down dark energy. 


Dark energy rips cosmos and agencies 


An international space mission to study an astronomical mystery is foundering. 


A once-favoured space probe to study dark 
energy is struggling to get off the ground, as 
three agencies in the United States and Europe 
tussle over the details of a potential interna- 
tional mission. 

The rise and fall this year of the Joint Dark 
Energy Mission (JDEM) — a satellite meant to 
pin down the repulsive force that is accelerat- 
ing the Universe’s expansion — is partly due 
to strife between two US agencies, NASA and 
the Department of Energy (DOE), and a third 
potential partner, the European Space Agency 
(ESA). In addition, scientists working on the 
JDEM designs have not presented a unified 
front, owing to disagreements over the best 
observational method to use (see “Hunting for 
dark energy’) — at a time when an influential 
astrophysics panel is about to prioritize the next 
decade's best and most organized missions. 

“This is an example of a satellite blowing 
up before it gets built,” says Bob Nichol, an 
astrophysicist at the University of Portsmouth, 
UK, who is working on the European design 
concept. 

Dark energy is a fudge factor for a force 
that has the upper hand on gravity, pushing 
the Universe to accelerate at an ever-faster 
rate. The effect was announced in 1998 after 
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astronomers precisely measured the distances 
to supernovae in other galaxies. But the cause 
remains baffling. “It is perhaps the biggest 
mystery of our time,’ says Neil Gehrels, JDEM 
project scientist at the Goddard Space Flight 
Center in Greenbelt, Maryland. “It determines 
the fate of the Universe.” Depending on how 
strong it turns out to be, dark energy could 
dissipate the Universe into darkness, shred it 
ina ‘big rip’ or even reverse itself, allying with 
gravity to create a ‘big crunch’. 


JDEM was meant to figure this out, and the 
mission had early momentum. It got a political 
boost from a 2007 study by the National 
Research Council that ranked a dark-energy 
probe as the top priority for studying deep cos- 
mological questions. Then, to forge a scientific 
consensus, in September 2008 NASA and the 
DOE brought together three design teams that 
had been competing to fashion a mission that 
could potentially accommodate all of them 
(see Nature 455, 577; 2008). 


Hunting for dark energy 


Scientists are arguing over 
three space-based approaches 
to measure dark energy. 


Supernovae Measuring 
distances to supernovae was 
the first, and best-understood, 
way to probe dark energy, but 
also has the most limitations. 
It assumes that all supernovae 
in the early Universe exploded 
with the same characteristic 
brightness. 


Baryon acoustic oscillations 
Sound waves from the Big 
Bang imprinted ripples in 
the distribution of galaxies. 
By comparing the size of 

the ripples in early galaxy 
clusters with those in clusters 
that formed later in the 
Universe, astronomers can 
deduce the effect of dark 
energy. But finding enough 
young clusters big enough to 
work with is a challenge. 
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Weak lensing This looks 
for tiny distortions in the 
shapes of galaxies caused 
by intervening dark matter, 
which varies with cosmic 
time and relates to dark 
energy. It requires images of 
many galaxies from surveys, 
which means that such 

a mission would need to 
operate differently from one 
optimized for baryon acoustic 
oscillations. 


E.H. 


NASA, ESA, HUBBLE KEY PROJECT TEAM, HIGH-Z SUPERNOVA SEARCH TEAM 


Two months later, the agencies inked an 
agreement in which NASA would lead the 
mission. The DOE has said it would contribute 
roughly one-quarter of the cost. 

But price was a problem. Separately, each 
of the three teams had missions with price 
tags of more than US$1 billion; combined, 
the JDOEM was just as expensive, if not more 
so. NASAS astrophysics division had always 
wanted something smaller and cheaper. But 
“how much dark energy can you buy for $600 
million?” asked Jon Morse, director of NASA's 
astrophysics division, at an advisory meeting 
this month. “No one seems to want to answer 
that question” 

So NASA went to ESA, which had been pur- 
suing its approach, called Euclid, for years. By 
January, it looked as if ESA — and its budg- 
etary resources — were on board (see Nature 
doi:10.1038/news.2009.38; 2009). 


Downgraded 

But when NASA told DOE officials that 
the Europeans would be building scientific 
instruments that the DOE had planned to 
make itself, the energy agency baulked. The 
DOE-NASA agreement specifies that the 
DOE would build a “major scientific instru- 
ment”. Yet in negotiations through the win- 
ter, the department was offered 


says, in preparation for a triage in February 2010 
under ESA’s ‘Cosmic Visions’ competition. 

On the US side, three design concepts 
remain in play. The $1.6-billion IDECS design 
has two types of detectors and would use all 
three observational methods on a 1.5-metre 
telescope. A $1.2-billion ‘Omega’ design would 
use only one detector and sacrifice some capa- 
bilities. Gehrels says that his team has now been 
tasked to look at a third concept, with a price 
cap of $850 million, that will require a smaller 
telescope and cut the probe's capabilities. 

Meanwhile, as the space-based mission ideas 
stall, ground-based astronomers are making 
strides in pinning down uncertainties in a 
key dark-energy parameter that determines 
whether the force is a constant or changes with 
time. One of many recent studies, which added 
more than 100 newly observed supernovae to 
existing studies, suggests that it is a constant 
with a precision of 7% (M. Hicken et al. Astro- 
phys. J. 700, 1097-1140; 2009). 

Simon White, director of the Max Planck 
Institute for Astrophysics in Garching, Ger- 
many, questions whether it is worthwhile to 
spend a billion dollars just to show, with more 
precision, that dark energy is a constant. “There's 
nothing to look for,’ says White. “It's fairly close 
to acosmological constant, and the question [for 

JDEM] is: is it very, very close to 


minor-league work that some “This is an example of a cosmological constant?” 


there considered insulting. 

“I don’t disagree that there 
were unhappy scientists on all 
three sides,” says Gehrels. But no division of 
duties was set in stone then, he says. And he 
points out that very few multi-agency missions 
are smooth; for instance, the $700-million 
Fermi Gamma-ray Space Telescope, with 
contributions from the DOE and NASA, 
went through a difficult birthing process for 
more than a decade before being launched last 
year. At a physics advisory meeting last week 
in Washington DC, William Brinkman, the 
DOE’ director for the Office of Science, said of 
JDEM: “It’s not an uncontroversial situation.” 

DOE officials and scientists were so unhappy 
that, for a few months this spring, they broke off 
from the mission. Some, at the Lawrence Berke- 
ley National Laboratory in California, instead 
proposed a ground-based dark-energy survey 
called BigBOSS. In presentations in June, the 
team claimed that BigBOSS would rival JDEM’s 
results for an overall cost of just $85 million. 

Meanwhile, NASA and ESA ploughed ahead 
with a design that became known as the Interna- 
tional Dark Energy Cosmology Survey (IDECS). 
In April, this partnership also ended because 
the merger wasn't happening quickly enough. 
In Europe, the Euclid team was told to resume 
work on its 1.2-metre space telescope, Nichol 


a Satellite blowing up 
before it gets built.” 


But all the recent studies 
that seem to be honing in on 
a cosmological constant make 
the assumption that it is constant in time, says 
Rocky Kolb, a theorist at the University of Chi- 
cago in Illinois. 

To achieve precision and to test how dark 
energy has varied in time, Kolb says, going to 
space is necessary. Both possibilities — a con- 
stant, or a changing dark energy — are equally 
ugly to theorists, who have few explanations for 
either scenario. 

Kolb notes that there is a third possible result. 
Several ground-based microwave telescopes, 
such as the South Pole Telescope, are track- 
ing how the structure of very distant galaxy 
clusters grew in the early Universe under the 
influence of gravity. If these results, expected 
soon, do not agree with the measurements of 
the expansion history of space — the measure- 
ments made by methods used by JOEM — it 
would indicate that something is wrong with 
general relativity. 

But JDEM may not get to make these com- 
parisons. NASA and the DOE will not commit 
to any of the designs until after the community 
survey weighs in in spring 2010. “We'll wait 
for the decadal survey,’ Morse said earlier this 
month, “to see if we have a mission.” a 
Eric Hand 
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Ozone protocol squares up to climate 


Europeans back efforts to amend the Montreal Protocol to address global warming. 


As climate observers fret over December's 
global-warming summit in Copenhagen, 
international ozone negotiators are quietly 
plugging away on a proposed amendment to 
the Montreal Protocol on ozone-depleting 
substances — one that could reshape the 
way governments tackle an important class 
of greenhouse gas. 

Delegates to the protocol will meet in Port 
Ghalib, Egypt, on 4-8 November to discuss 
proposals to reduce the use of hydrofluoro- 
carbons (HFCs), common refrigerants that 
were deployed to replace ozone-destroying 
chemicals such as chlorofluorocarbons. 
Although HFCs do not damage the ozone 
layer, some are up to 12,000 times more 
effective than carbon dioxide at trapping 
heat. Advocates of the proposals say that 
HFCs could be efficiently regulated under 
the Montreal Protocol — which has proved 
effective in cutting the use of ozone-depleting 
substances — to help slow global warming. 

European Union (EU) negotiators 
endorsed the idea during the United Nations 
climate talks in Bangkok earlier this month, 
adding momentum to a movement led by 
the small island states of Mauritius and 
Micronesia, which are seeking aggressive 
action on climate change to stave off rising 
sea levels. Those states submitted an initial 
amendment to the protocol in April, followed 
last month by a proposal from the United 
States, Canada and Mexico. 

Nonetheless, it is unclear whether Montreal 
delegates will move forward with such a decision 
when they meet in Egypt. European officials 
remain wary of letting the Montreal process get 
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Areal air con: local cooling, but global warming. 


and would be prepared to take this on if so 
instructed,” says Ana Maria Kleymeyer, 
Argentina's former lead negotiator on the 
protocol, who now works at the Inter- 
national Centre for Trade and Sustainable 
Development in Geneva, Switzerland. 

Montreal delegates first ventured 
into global warming in 2007 when they 
agreed to accelerate the phase-out of 
hydrochlorofluorocarbons to reduce the 
gases’ greenhouse effects. Regulating HFCs, 
however, would require an explicit expan- 
sion of the Montreal treaty. 

Language proposed by European climate 
negotiators would allow their Montreal coun- 
terparts to move forward on HFCs as long as 
any greenhouse-gas reductions achieved by 
decreasing HFCs could count towards com- 
mitments under the UN climate treaty. 

“The Europeans just want to make sure 
that whatever is done on the Montreal Pro- 
tocol side is credited on the climate side,” 
says Durwood Zaelke, president of the 
Institute for Governance & Sustainable 
Development in Washington DC. He says 
one possibility is that EU negotiators in 
Egypt could agree to set global HFC emis- 
sion standards for vehicles, consistent with 


too far ahead of the Copenhagen climate talks 
(see ‘On the global-warming front’), and the 
idea has yet to garner endorsement from emerg- 
ing economies such as China and India, which 
represent the primary growth market for HFCs 
owing to rising demand for air conditioning. 

“I could envision something that says, “We 
are willing and interested in the possibility 


On the global-warming front 


As ozone negotiators look for 
ways to help battle global warming 
(see above), the international 
representatives who work solely 
on climate are inching along 
towards their own deal. 

Last week in Luxembourg, 
European Union (EU) 
environment ministers fleshed out 
their negotiating position for the 


commit to developing 
countries for adaptation 
and mitigation to 
climate change. 

The environment 
ministers endorsed 
earlier commitments to 
cut emissions by 20% 
by 2020 compared 
to 1990 levels, and by 


respectively, below 
2005 levels by 2020. 
European heads of 
state are expected 
to approve these 
recommendations 
and to tackle the issue 
of climate financing 
when they meet in 
Brussels this week. 


current EU domestic regulations. 

The European Commission says it wants to 
resolve how to coordinate financing, account- 
ing and regulations between the Kyoto and 
Montreal treaties before moving forward. The 
commission is calling on a technical commit- 
tee established under Montreal to analyse the 
proposals by the middle of 2010. a 
Jeff Tollefson 


developing low-carbon economies 
to cope with the impacts of 
climate change. 

In the United States, meanwhile, 
Senate Democrats have begun 
work ona bill to curb US emissions 
by 20% by 2020 and by 83% 
by 2050, compared with 2005 
levels. The Senate Environment 
and Public Works Committee is 


climate summit in Copenhagen 
this December, but finance 
ministers were unable to agree 
on how much money the EU will 
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30% if there is a global treaty; 

they also said that emissions from 
the aviation and maritime sectors 
should be curbed by 10% and 20%, 


The European Commission has 
estimated that, in total, developing 
countries will need €100 billion 
(US$150 billion) annually to begin 
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hearing testimony on the bill this 
week, to be followed “as soon as 
possible” by amendments anda 
final vote. 


JT. 


G. BAKER/AP 


FATAL FROG FUNGAL 
DISEASE FIGURED OUT 
Electrolyte imbalance stops 
amphibians’ hearts. 
go.nature.com/BOUiwd 
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University tightens oversight of sensitive research 


University administrators are looking 
to sharpen their monitoring of export 
violations, officials said last week at 

a meeting of the National Council of 
University Research Administrators in 
Washington DC. 

The move comes in the wake of the first 
US conviction, last year, of a university 
professor for trafficking military-sensitive 
information. In July, John Reece Roth, 
formerly an engineer at the University 
of Tennessee, Knoxville, was sentenced 
to four years in prison for breaching the 
Arms Export Control Act; he remains 
free pending an appeal. Roth had shared 
sensitive information relating to a plasma- 
guidance system for unmanned aircraft with 
a graduate student from Iran and another 
from China. The case has triggered anxiety 
among many academics, who fear that they 
could be punished for unintentional slips 
(see Nature 461, 156; 2009). 


“Now that the faculty members know the 
facts of the Roth case, they don’t want to 
be individually challenged; they want the 
support of the university,” says David Brady, 
director of export and secure research 
compliance at Virginia Polytechnic Institute 
in Blacksburg. 

Roth’s project “slipped 


“It helps us develop 


about export control of sensitive data or 
technology. 

Christopher Golomb, a special agent 
with the FBI counterintelligence division 
in Washington DC, says that in light of the 
Roth conviction, the bureau is also adjusting 
the ways that it interacts with 
academic institutions. Last 


through the cracks” of the policies and year, the agency conducted 
university's monitoring procedures so that a survey with the Federation 
system, says Robin of American Scientists to 
Witherspoon, export-control professors know they assess negative views of law 


officer for the University 
of Tennessee's office of 
compliance. Witherspoon says that the 
university now has electronic flagging 
systems in place to alert her office to 
suspicious financial- or travel-related 
dealings with other countries, as well as any 
potentially problematic grant proposals or 
contracts. Witherspoon has also instigated 
training programmes to educate researchers 


can be targeted.” 


enforcement held by some in 
the scientific community. The 
FBI is also engaged in two academic alliances 
with university and college presidents. “It’s a 
great lesson learned,’ says Golomb. “It helps 
us develop policies and procedures so that 
professors know that they can be targeted.” 

The White House has ordered a review of 
current US export control regulations. 
Elie Dolgin 
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Q&A: MAKINGA 
CELLULAR MENAGERIE 
Caroline Kane talks about a 
new image library of the cell. 
go.nature.com/N9duow 


Jury still out on HIV vaccine results 


Some experts see hope in trial findings, but others say that the data do not back up such optimism. 


PARIS 

More than 1,000 researchers in Paris 
last week rapturously applauded 
formal results from the largest-ever 
HIV vaccine trial. In a preliminary 
announcement in September, the 
trial, which included 16,402 peo- 
ple in Thailand, was said to show 
that a vaccine combination reduced 
the risk of HIV infection by nearly 
one-third (see Nature doi:10.1038/ 
news.2009.947; 2009). 

But some scientists are sceptical, 
arguing that the response of the HIV 
research community, long deprived 
of any good news from vaccine tri- 
als, is based more on hope than on 
rigorous science. 


Infection rates dropped slightly during a large HIV-vaccine trial. 


to the vaccine schedule — is the most 
valid, says Adel Mahmoud, former 
president of Merck Vaccines and now 
a molecular biologist at Princeton 
University in New Jersey. 

“The results of this trial should be 
treated with caution and some scep- 
ticism,’ says Tim Peto, a researcher in 
tropical diseases and clinical medi- 
cine at the University of Oxford, UK. 
“My view is that a more balanced 
interpretation of the data is that the 
results show some evidence that the 
vaccine might be effective and that, 
unlike previous vaccine studies, this 
study cannot clearly rule out that the 
vaccine is ineffective.” 

Nelson Michael, a researcher at 


The US$119-million phase II trial, 
sponsored by the health ministry of Thailand 
and the US Army, started in 2003. Half of the 
recipients served as a control group; the other 
half were given four shots of ALVAC-HIV, an 
attenuated canarypox virus carrying HIV genes, 
and two shots of AIDSVAX, arecombinant form 
of the gp120 HIV surface protein. The trial’s 
results were published on 20 October to coin- 
cide with the AIDS vaccine meeting in Paris (S. 
Rerks-Ngarm et al. N. Engl. J. Med. doi:10.1056/ 
NEJMoa0908492; 2009). 

The results are a “milestone in HIV vaccine 
research’, says first author Supachai Rerks- 
Ngarm, of the Thai Ministry of Public Health. 

“Because the history of preventive inter- 
ventions against HIV has been so poor, the 
HIV research community has seized on this,’ 
counters Peter Smith, a tropical epidemiologist 
at the London School of Hygiene & Tropical 
Medicine. “There is not much evidence from 
the data that it protects at all” 

The trial was set up to measure the number 
of people in each group who became infected 
with HIV, and the amount of the virus that was 
circulating in the blood (the viral load) of those 
who became infected during the trial. 


The teams analysed infection rates in three 
ways (see table). Two of those methods (known 
as intention to treat (ITT) and per-protocol) 
found a 26% drop — not statistically signifi- 
cant — in infection rates in the vaccine group 
compared with the control group. The third 
method (mITT), the only one presented on 
24 September, excluded participants who had 
contracted HIV between the time they enrolled 
in the trial and their first vaccination. In this 
analysis, the reduction between the vaccine 
and control group infections was 31%, which 
just scraped into statistical significance. 

This showed that statistical significance was 
highly dependent on whether very small num- 
bers of individuals were excluded from either 
group. The small numbers of infected individu- 
als in the trial — 132 across both groups — also 
meant that none of the subgroup analyses was 
statistically significant. 

The ITT analysis is generally considered the 
main yardstick of the outcome of drug clinical 
trials, although an mITT analysis is acceptable 
ifagreed by independent experts. For vaccine 
trials, the per-protocol analysis — in which 
patients are excluded if they don't strictly adhere 


ONE TRIAL, THREE METHODS, THREE RESULTS 


the Walter Reed Army Institute of 
Research in Silver Spring, Maryland, and direc- 
tor of the US Military HIV Research Program, 
which co-organized the trial, argues that includ- 
ing people who didnt stick to their shots reflects 
a real-world vaccination scenario, and that 
excluding those who were HIV-positive before 
the trial began was justified. 

The trial also failed to detect any difference 
between the viral load in the two cohorts. 
Mahmoud calls this “very, very disturbing’, 
because an effective vaccine would be expected 
to at least reduce the viral load in people who 
become infected. However, Michael suggests 
that this finding might still prompt new vaccine 
leads, and that scientists should see whether 
they can uncover the immunological origin of 
any possible vaccine protection. 

Dan Barouch, a HIV vaccine researcher at 
Harvard University in Cambridge, Massa- 
chusetts, says that the results are ultimately 
positive. “We don’t understand why we saw 
the protection that we did, and the results are 
only modest, but nevertheless the Thai trial 
provides the first evidence of vaccine protec- 
tion in humans,’ he says. 

“Everyone is saying let's try to have hope, and 
this is a hope that the results mean something,” 
says Mahmoud. “But raising expectations with 
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Method of analysis Number of patients  Patientswhobecameinfected Reduction in infection no fundamental scientific base is dangerous.” 
rate in treated vs control group Declan Butler 

Intention to treat 16,402 56 vs 76 26.4% See Editorial, page 1174. 

Per protocol* Vase 36 vs 50 26.2% Correction 

Modified intention to treat} 16,395 51 vs 74 31.2% Tye ews [estore Scaling He wall sale 


461, 586-589; 2009) incorrectly located the 
University of Szeged in Budapest, Hungary. It is, 
of course, in Szeged. 


*Analysis excluded participants who failed to strictly adhere to the trial protocols, such as the calendar of vaccinations. +Analysis 
excluded participants who contracted HIV between the time they enrolled in the trial and their first vaccination. 
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RISK 
SCHOOL 


Can the general public learn to evaluate risks accurately, or do authorities need to steer it 
towards correct decisions? Michael Bond talks to the two opposing camps. 


group of eight-year-olds sits around 
a classroom table, playing with col- 
oured, plastic boxes called tinker- 
cubes and linking them into chains. It 
could be playtime at almost any primary school 
in the world. But in this classroom, located in 
Stuttgart, Germany, the ‘toys’ are actually giving 
the children their first lesson in probabilistic 
reasoning. The cubes represent the children’s 
attributes — red cubes for girls, blue for boys; 
a yellow cube attached to a red cube for a girl 
with glasses, a green cube attached to a blue for 
a boy without glasses. The students end up with 
asymbolic representation of their classmates as 
a group. And by collecting the cubes in various 
bins — boys versus girls, glasses versus non- 
glasses and so on — they begin to get a feel for 
the probability that, say, a boy will wear glasses 
or that a girl will not. It is play that is not quite 
play — yet the children seem hooked. 
Eight might seem a little young to be learning 
a branch of mathematics that many students 
struggle to master in high school. But the idea 
behind the exercise — an experiment devised 
in 2005 by Elke Kurz-Milcke at the Institute of 
Mathematics and Computing in Ludwigsburg, 


Germany, and tested in a number of German 
schools — is that earlier is better. Teaching 
schoolchildren how to deal with frequencies 
and probabilities helps to prepare them for the 
complexities and uncertainties of the modern 
world, and will help them make sound deci- 
sions throughout their lives. 

That’s a view strongly endorsed by Gerd 
Gigerenzer, a psychologist 
at the Max Planck Institute 
for Human Development in 
Berlin and a frequent collab- 
orator with Kurz-Milcke. “At 
the beginning of the twenty- 
first century, nearly every- 
one living in an industrial 
society had been taught reading and writing 
but not how to understand information about 
risks and uncertainties in our technological 
world,” he says. Earlier this year, Gigerenzer 
set up the Harding Center for Risk Literacy at 
the Max Planck Institute to try to remedy this 
situation. Funded for an initial five or six years 
by a €1.5-million (US$2.2-million) grant from 
David Harding, managing director of the Lon- 
don-based investment-banking firm Winton 
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Capital and a teacher of risk communication at 
the University of Cambridge, UK, Gigerenzer 
and his team of five scientists have a twofold 
aim. First is to do basic research on how people 
perceive risk and second is to improve people’s 
statistical and decision-making skills through 
education programmes. 

Indeed, Gigerenzer is an outspoken advocate 
for the idea that people can 
be taught to improve their 
decision-making skills and 
has taken it upon himself 
to organize other research- 
ers and set up projects. But 
this idea is considerably 
more controversial than it 

might seem. “There isa serious division in the 
research community,” says Dan Kahan, who 
studies risk perception at Yale Law School in 
New Haven, Connecticut. He points out that 
many specialists in the field conclude from 
existing research that the public will never 
really be capable of making the best decision 
on the basis of the available scientific informa- 
tion. Therefore, he says, “risk decision-mak- 
ing should be concentrated to an even greater 
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extent in politically insulated expert agencies”. 
Those agencies, in turn, should guide or ‘nudge’ 
people into better decisions by presenting 
information more appropriately. 

One thing both sides agree on is that poor 
decision-making is ubiquitous and has a seri- 
ous effect on people's well-being. Faced with 
an unfamiliar or emotion-fraught situation, 
most people suspend their powers of reason- 
ing and go with an instinctive reaction that will 
often lead them astray. Witness the widespread 
fears in the United Kingdom and the United 
States over the past 10 years over links between 
autism and the measles-mumps-rubella vac- 
cine. Despite the lack of convincing evidence 
for such an association, many parents have 
chosen not to have their children vaccinated, 
leading to a rise in cases of potentially lethal 
measles. Likewise, a warning by the UK Com- 
mittee on Safety of Medicines in 1995 that the 
third-generation contraceptive pill increased 
the risk of dangerous blood clots by 100% was 
followed by an additional 13,000 abortions the 
next year, many of them in teenage girls. The 
fact that the increased risk amounted to just an 
extra 1 in 7,000 was lost on most people — and, 
crucially, was not passed on by the media. 

Exaggerated risk judgements also make 
themselves felt on environmental issues. 
Examples include persistent fears over the 
dangers of genetically modified crops in 
Europe, despite studies showing that the risks 
are considerably lower than the scare stories 
allege, and the hysteria triggered in the United 
States during the late 1980s by reports — argu- 
ably overblown and still controversial — that 
the plant growth regulator daminozide (Alar), 
used on apples and other fruit, was a potent 
human carcinogen. “Exaggerated risk judge- 
ments can lead to anxiety that degrades qual- 
ity of life and causes excessive vigilance and 
self-protective behaviours,’ warns Ellen Peters 
of Decision Research, a non-profit group in 
Eugene, Oregon, that investigates human 
judgement and decision-making. 


Top down 

Even those who might be expected to know 
better — doctors, medical journalists or finan- 
cial speculators, for example — often fall into 
the same traps as everyone else. In one experi- 
ment, Gigerenzer asked 160 gynaecologists to 
interpret some basic statistics about a woman's 
chances of having breast cancer, given that her 
mammography screening had come back posi- 
tive. Just 21% gave the right answer’. 

“Our ability to de-bias people is quite lim- 
ited,” says Richard Thaler, director of the 
Center for Decision Research at the Univer- 
sity of Chicago in Illinois. Thaler teaches a 
course in decision-making to MBA students 
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Genetically modified organisms (GMOs) have become the target of overexaggerated safety fears. 


in their final quarter at the university's business 
school. Even though the students should have 
picked up a lot about statistics and decision- 
making by this time, when tested at the start 
of his course they exhibit all the same biases 
found in other groups, says Thaler. “After ten 
weeks of my course they do learn a bit,” he says, 
“but I hardly turn them into rational economic 
decision-makers.” 

The problem, as many researchers in 
cognitive neuroscience and psychology have 


— 


Initial concerns caused many parents to deny 
their children the potentially life-saving measles- 
mumps-rubella vaccination. 
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concluded, is that people use two main brain 
systems to make decisions. One is instinctive 
— it operates below the level of conscious con- 
trol and is often driven by emotions. The other 
is conscious and rational. The first system is 
automatic, quick and highly effective in situ- 
ations such as walking along a crowded pave- 
ment, which requires the near-instantaneous 
integration of complex information and the 
carrying out of well-practised action. The sec- 
ond system is more useful in novel situations 
such as deciding on a savings plan, which calls 
for deliberative analysis. 

Unfortunately, the first system has a way 
of kicking in even when deliberation would 
serve best. Consider a well-known example: a 
bat and a ball cost $1.10 in total, the bat costs 
a dollar more than the ball, so how much does 
the ball cost? When Shane Frederick at the 
Massachusetts Institute of Technology in Cam- 
bridge analysed the responses to this question 
by nearly 3,500 individuals at eight American 
universities, less than half gave the right answer 
(5 cents)”. Intuition suggests that the answer is 
10 cents (it seems to fit and it feels right), and 
the rational system does little to correct this 
unless a conscious effort is made to intervene. 

Such findings are why many researchers 
think that attempts to improve decision-mak- 
ing through education, which tries to put the 
rational system in charge of the instinctive one, 
lie somewhere between over-optimistic and 
hopeless. Two of the most prominent scep- 
tics are Thaler and Cass Sunstein, a professor 


T. CHARLIER/AP 


S. TAN/AP 


D. AUSSERHOFER/MPI FOR HUMAN DEVELOPMENT 


S. KELLY/PA WIRE 


Gerd Gigerenzer thinks that an early education in statistics will go a long way towards helping children 
to deal with life's uncertainties. 


at Harvard Law School who heads the White 
House's Office of Information and Regulatory 
Affairs. Thaler and Sunstein’s 2008 book Nudge 
(Yale University Press) urges governments and 
institutions to steer people’s choices in ways 
that should improve their lives — an approach 
Thaler and Sunstein call “libertarian paternal- 
ism”. Examples include automatically enroll- 
ing people into organ-donation schemes and 
pension plans unless they specifically choose to 
opt out (rather than the default being that they 
are not enrolled, then asking them to opt in); 
dollar-a-day programmes to reduce teenage 
pregnancies (girls receive a dollar for each day 
they are not pregnant); and the use of software 
recognition to delay the transmission of angry 
e-mails, giving people the option to delete 
before sending. In general, the idea behind the 
‘nudge’ approach is to shape incentives 
and present information in a way that 
increases the chances that people will 
exercise good judgement. 

Gigerenzer has no problem with 
improving the way that the informa- 
tion is presented. He points out that 
health statistics are often framed in 
ways that confuse not only patients 
but doctors, too. His Harding Center 
is collaborating with health insurers 
in Germany to persuade authorities 
to present health information more 
transparently, and he has convinced a 
German medical association to rewrite 
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By giving people the chance to opt 
out of schemes, rather than opting in, 
governments can push people to make the 
decisions they think are right for society. 


one of its brochures to achieve the same kind 
of clarification. 

But Gigerenzer is critical of those who push 
the nudge approach exclusively and essentially 
give up on people's ability to learn and reason 
for themselves. Some people, he 
says, like to attribute every poor 


control. He maintains that there is 
plenty of evidence that people can 
learn to rewire their minds — or at least, that 
they can learn cognitive tricks that help them 
to recognize and compensate for their biases. 
Back in the 1980s, for example, Richard Nisbett 
at the University of Michigan in Ann Arbor 
and his colleagues found that half an hour's 
training in statistical reasoning significantly 
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“Our ability to 


decision to hard-wired mental de-bias people is 
processes that humans cannot quite limited.” 


— Richard Thaler 
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improved people’s ability to rationalize 
everyday problems’. That included problems 
not generally thought of in terms of probabili- 
ties, such as whether a group’s performance can 
be predicted from the performance of one or 
two of its members, or how to infer someone's 
personality from first impressions. 

Gigerenzer’s optimism about education 
finds cautious support from Daniel Kahne- 
man, a senior scholar at the Woodrow Wilson 
School of Public and International Affairs 
at Princeton University in New Jersey, anda 
winner of the Nobel prize in economics for his 
pioneering work in the psychology of decision- 
making. “It takes an enormous amount of 
practice to change your intuition,” says Kahne- 
man. “Intuition rules decision-making, that is 
human nature and that is how it is going to be.” 
Nonetheless, he says, people can improve their 
critical thinking so that they become better at 
detecting when they might make a mistake. 
They are then in a better position to prevent 
or correct it. 


Instinctive bias 

Researchers have found that some of the most 
effective cognitive tricks include looking at a 
problem from an outsider’s perspective; con- 
sidering the opposite of whatever decision you 
are about to make; and weighing up multiple 
options simultaneously rather than accepting 
or rejecting each one in turn’. Such tricks add 
up to what Jonathan Baron at the University of 
Pennsylvania in Philadelphia calls 
“actively open-minded thinking” 
— an approach in which people 
intentionally look beyond the 
first conclusions that come to 
mind. He and other researchers 
have found that some people are much better 
at this than others. “It isn’t completely clear 
where these differences come from, but I think 
this kind of result is optimistic as it suggests 
these biases — unlike, say, [interpretation of] 
visual illusions — are not an unalterable part 
of the human condition” 

One clue to the origin of the differences 
comes from mathematics. Peters has found 
that when people with low numeracy skills 
are asked to assess the risks of a potential ter- 
rorist action, they are more likely than high- 
numeracy individuals to over-estimate the 
likelihood of an attack’. In addition, she found 
that numerate people are better at interpret- 
ing data about real-word scenarios, such as 
the performance and quality of hospitals and 
health insurance plans’. 

Peters argues that people who use numbers 
more effectively in decision-making do so 
because they are better at giving numbers 
emotional significance and seeing them as 


1191 


NEWS FEATURE 


representing reality in some way — what 
is known as ‘affective meaning. She sug- 
gests that it may be no coincidence that 
people with low numeracy skills tend to 
have a high body-mass index and tend 
to be poor at managing their own health. 
The challenge, says Peters, is to find a way 
to structure mathematics education so 
that students grasp the meaning in num- 
bers faster. 

This is what her colleague Paul Slovic at 
Decision Research calls “learning to feel 
the numbers”. He favours teaching chil- 
dren to deal with numbers in a contextual 
way as soon as they start to learn to count. 
For example, teachers should describe the 
number 10 in terms of something tangible 
— say, 10 ice-cream cones — so that chil- 
dren can remember the number in a way 
that relates to the real world. Or they could 
ask children to consider how it makes 
them feel if someone gives them a penny. 
What about two pence, three pence? “Get 
them to think about their feelings in rela- 
tion to numbers and whether their feel- 
ings are logical or not,” says Slovic. 


Offering teenagers a dollar for every day they are not 
pregnant could reduce teen pregnancy rates. 


comprehensive education in probabilities 
and risks, there is often resistance from 
teachers who are wedded to the old sys- 
tem of teaching it. “In most parts of the 
world, children are taught the mathemat- 
ics of certainty, not the mathematics of 
uncertainty,” he says. “Although geom- 
etry and trigonometry are beautiful 
systems, they are of little use in life after 
school compared with statistical think- 
ing. The twenty-first century is at least as 
risky and uncertain as those before, and 
we need to prepare the next generation”” 

In the end, both the education 
approach and the nudge approach are 
likely to have a role. When it comes to 
making better judgements, whether it’s 
dealing with complex data or with con- 
flicting emotional states, people — and 
societies — need all the help they can get. 
“Societally, we can do more with nudging 
people along, but individuals and organi- 
zations still want to think more clearly,’ 
says Max Bazerman, who studies deci- 
sion-making at Harvard Business School. 
With Sunstein now working within the 
administration of US President Barack 


Statistical shortfall 

Gigerenzer’s goal is to make such ideas an 
integral part of education at every level. Much 
of his educational work is aimed at adults who 
deal with risk in their professional lives. The 
Harding Center offers training seminars in 
decision-making and understanding uncer- 
tainties to doctors, journalists and other 
specialist groups, an activity that has taken 
Gigerenzer around the world. His past clients 
include about 1,000 German gynaecologists — 
one-tenth of all those practising in the coun- 
try — and 40 US federal judges. Of some 200 
accredited law schools in the United States, he 
points out, only one — George Mason Uni- 
versity School of Law in Arlington, Virginia 
— regularly teaches statistical thinking. “So 
you have an entire society, including judges 
and doctors, who are not being prepared for a 
modern technological world containing many 
kinds of risks,” he says. 

Gigerenzer is also trying to persuade edu- 
cation authorities to integrate the latest find- 
ings on risk perception into school curricula, 
starting when children first start school and 
continuing right through until they leave. He 
is in regular contact with German education 
authorities, and is also working with the larg- 
est German health insurance company, AOK, 
to find a way to implement a programme on 
statistical thinking in schools in the state of 
Baden-Wirttemberg. Health insurers are 
interested, he says, because they realize that 
the health system does not run effectively, 
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“partly because patients don’t understand the 
evidence”. The idea is to prepare the next gen- 
eration so they know what questions to ask. 
The key, he says, is for schools to teach 
real-world statistical problems — for exam- 
ple, calculating the chance that someone who 
tested positive for HIV actually has the virus, 
or comparing the dangers of 


Obama, the nudge approach seems to be 
gaining political capital; reforming education 
is proving more of a struggle. 

The problem, says Gigerenzer, is as much 
ignorance as resistance to change among educa- 
tors and policy-makers. “Often those who don't 
understand, don't understand that they don't 
understand’ But he is convinced it is worth the 

fight to get the message across. 


riding a motorcycle in differ- “It takes an He receives “a stream of letters” 
ent countries. Primary schools enormous amount from mathematics teachers who 
should help pupils get used to . have used his real-life statistical 
probabilistic thinking with pro- of practice to change examples in their lessons and 
grammes such as Kurz-Milcke’s YOur | ntuition. found that their students become 


tinker-cube exercise. “Our goal 
is for statistics to be taught not 
as a mathematical discipline, but as a problem- 
solving discipline, Gigerenzer says. 

Gigerenzer has had some success: several 
German statistics textbooks now use exam- 
ples from his 2002 book Reckoning With Risk 
(Allen Lane). Furthermore, in many German 
states it is now compulsory to start teaching 
data analysis and probabilities from the first 
year of school. The idea is also catching on in 
the United States, where the National Council 
of Teachers of Mathematics has declared its 
commitment to teaching probabilities up to 
year 12. 

Still, says Gigerenzer, there is no nationwide 
programme in any country that systematically 
teaches examples in statistics that students can 
usefully apply to real-life situations. And even 
in schools that have accepted the need for a 
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— Daniel Kahneman 


much more interested in the sub- 
ject because it applies to the world 
they see around them. The long-term benefits for 
children could be spectacular: a statistical educa- 
tion that they will be able to draw on throughout 
their lives. The eight-year-olds puzzling over their 
coloured tinker-cubes in that classroom in Stutt- 
gart should leave school well equipped to deal 
with the uncertainties of the modern world. m 
Michael Bond is a freelance writer based in 
London. 
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Sean Mackey inflicts pain on people in the hope of learning how to relieve it. 
Erik Vance gets on the receiving end. 


utside neurology and his family, Sean 
Mackey doesn’t have many hobbies. 
The one exception is his monstrous 
flat-screen television and large film 
collection. Driving to Stanford, California, on 
the day I am to visit Mackey’s lab for testing, 
Iam reminded of a scene from his favourite 
movie, The Princess Bride. In the film, the vil- 
lain, Count Rugen, straps the hero Westley into 
a sinister apparatus and confesses a “deep and 
abiding interest in pain”. Then he tortures the 
hero in the name of science. 
It turns out that this is not far 
from what is in store for me. 
Mackey heads the Pain Man- 
agement Center at Stanford 
School of Medicine where, as 
part of his research on ways 
to relieve pain, he routinely 
inflicts it. Widely seen as one of 
the field’s rising stars, Mackey is part of a move- 
ment to upend the way scientists look at pain, 
drawing the focus away from the nerves that 
sense it, towards the brain that processes it. His 
primary tool is functional magnetic resonance 
imaging (fMRI), which can create images of 
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the brain responding as the body is hurting. 
The trick now — and one focus of Mackey’s 
work — is to understand whether a person can 
consciously change the way the brain processes 
and perceives pain. That’s where I come in. 
The plan is to put me inside the fMRI scanner, 
apply burning heat, and see whether I can train 
myself to regulate my pain. 

As part of his studies, Mackey has found 
himself struggling with a question facing the 
fMRI field as a whole: when is the technique 

ready for use outside the lab? 
One of his former colleagues 
has started a company that 
plans to offer patients the 
fMRI ‘feedback’ pain-control 
technique that Mackey was 
involved in developing. But 
Mackey has distanced himself 
from the company in these 
early stages, based on what he has observed 
elsewhere. “I’ve seen too many treatments that 
are the next latest and greatest thing out there 
that people get really excited about. Every- 
body gets on board and initially the results are 
fantastic. And then as time goes by we start to 
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see that the results are not as good as initially 
proposed,” he says. “And then you find out that 
it doesn't work at all” 

Walking into his office near the Stanford 
Hospital, Mackey is more reminiscent of a cor- 
porate executive than a brain researcher. He is 
a cheery, focused ball of energy, with a quick 
smile and a firm, reassuring handshake. He 
regularly wears a suit in the lab. He says it fights 
the stereotype that all anaesthesiologists — he 
trained as one — wear “pyjamas” to work. 


Destructive force 

According to Mackey, the suit also tells 
patients that he takes his work seriously. They 
may be recovering from knee surgery, they 
may be wounded veterans or they may have 
a rare neurological condition that can cause 
excruciating full-body pain. According to 
the International Association for the Study of 
Pain, one in five people endures moderate to 
severe chronic pain. “I have seen it take people 
who are otherwise normal and turn their lives 
upside down and absolutely destroy them,” 
says Mackey. Or as his colleague Ian Carroll, 
another Stanford anaesthesiologist, puts it: “It’s 
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Sean Mackey is using 
imaging to understand p 
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like the black hole of the brain. It dominates it 
and forces everything to spin around it” 

The experience of pain typically starts in 
receptors near the skin called nociceptors that 
transmit information through axon fibres to 
neurons in the spine, then to the brain. Until 
the 1990s, pain research focused mostly on 
nociceptors as well as neurons near the spinal 
cord. Pain experts would treat a backache, 
say, directly on the back. If they addressed the 
brain, it might have been with opioids, whose 
mechanisms were somewhat mysterious. 

The arrival of the {MRI scanner changed that. 
The technique is an indirect measure of neu- 
ral activity in the brain: as a region activates, it 
consumes oxygen, and neurologists use fMRI 
to track fresh oxygenated blood surging in to 
replace the old. “Imaging is now a huge part 
of the field? says Allan Basbaum, editor of the 
journal Pain. Along with behavioural studies, 
imaging has helped to build the view that pain 
involves many brain areas and that chronic pain 
may cause long-term changes to the morphol- 
ogy or function of some of these regions. 

Asa technique, fMRI has its share of detrac- 
tors. Interpreting the image relies on compli- 
cated analyses that link blood movement to a 
given task performed by the subject, and many 
neuroscience studies have come under fire for 
poor data analyses or interpretation. Although 
Mackey is a devoted user of fMRI, he is also 
an occasional critic. He has testified against 
the use of the technique in several court cases 
where defendants wanted to use fMRI images 


to prove they were telling the truth. 

“This is an interesting time in the use of this 
tool,” says Mackey. “ You are now seeing that 
the application has gotten easier and easier. 
It’s still not quite like ordering a McDonald’s 
Happy Meal, where you put somebody into 
the scanner, press a button, and the brain pic- 
tures come out, but we are probably going to 
get there.” 

It may seem odd to base 
so much of one’s scientific 
life on a single tool and then 
lobby against its application. 
But Mackey has a different 
background from many pain 
researchers. In addition to 
the MD in anaesthesiology, 
he has a PhD in electrical and computer engi- 
neering from the University of Arizona. He 
chose to work in neurology — he calls it an 
uncharted “Wild West” — because he could 
treat patients and flex his engineering mus- 
cles. This training, colleagues say, gives hima 
valuable understanding of the biology of pain, 
the complex instrumentation for measuring it 
and the limitations of that technology. 


Personal limits 

Mackey thinks that pain could be a useful 
proving ground for fMRI. Unlike hard-to- 
define cognitive or emotional states — say, 
deception, jealousy or anger — pain can be 
elicited in a controlled way at specific levels, 
is highly repeatable and leads to a common 
response: it hurts. The intensity of the pain 
experienced varies from person to person, but 
can be ranked on a scale. 

The previous day, I had gone to have my 
own pain threshold tested. A friendly doctor 
took me into a small room and strapped to my 
arm a ‘noxious thermal stimulus, more accu- 
rately a ‘hot metal pad that hurts: She added a 
pepper-based cream that made my skin sensi- 
tive to heat. Then she slowly worked me up to 
a pain level that I ranked as seven out of ten. A 


“There are ethical 
questions when you 


are in the middle of 
someone's cognition.” 
— Gary Glover 


seven is considered the worst pain a person can 
tolerate without moving, and everyone will 
reach their seven at a different level of heat. 

In the {MRI machine, Mackey planned to 
use the same burning metal plate to take me 
directly to seven and leave me there. I would 
be practising a technique that grew out of a 
collaboration between researchers at Stanford 
and Christopher deCharms, 
a visiting researcher from the 
Massachusetts Institute of 
Technology (MIT) in Cam- 
bridge. The team showed 
people a changing image — a 
line graph or a picture ofa fire 
— representing the real-time 
fMRI signal in their anterior 
cingulate cortex, a region commonly studied in 
relation to pain. They showed that people could 
learn to manipulate the {MRI signal and their 
perception of pain intensity through visualiza- 
tion exercises, such as ‘turning dowr pain like 
a radio dial. The team reported in Proceedings 
of the National Academy of Sciences (PNAS) 
that patients with chronic pain reported on 
average a 64% reduction in pain on one scale’. 
The study showed that fMRI could be used not 
only as a diagnostic, but as the means to ther- 
apy itself, and that people can exert conscious 
control over specific brain regions much as it is 
known that some people can consciously alter 
their heart rate. 

In some of his other work, Mackey’s labora- 
tory has used fMRI to explore these connec- 
tions between pain processing and cognitive 
processes. Fear of pain, for example, can 
increase the pain itself, and Mackey’s group 
studied some of the brain regions involved 
in this anticipation’. In another study* he 
showed that watching someone else in pain 
activates brain areas that are fairly distinct 
from those active during one’s own pain. And 
in unpublished work he has found that roman- 
tic love can lessen the experience of pain. 
Mackey says these connections demonstrate 


Studies suggest that people can learn to control pain using feedback to show them their brain activity. 
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how strong an influence conscious thought 
may have over pain processing. 

But can this conscious control be put to 
use? Inside the coffin-sized tube of the {MRI 
machine, spasms in my back from its powerful 
magnet distract from the burning plate strapped 
again to my arm. On a screen above, I can see 
a squiggly line that represents the activity in a 
part of my anterior cingulate cortex. Mackey 
asks that I envision the heat as alternately sear- 
ing and soothing. The aim is to master control 
of the line so that it (and thus my pain) goes up 
and down. As I switch between these visions the 
line on the screen twitches up and down. 

It is surprisingly difficult. Willpower and 
meditation have little effect, and after two 
hours it is increasingly hard to make the stub- 
born little line move at all. The PNAS study 
suggests that it would probably take several 
sessions before I could be deft enough to 
reduce other pains in my body. 


Commercial approach 

DeCharms, first author of the PNAS paper, 
started a company called Omneuron in Menlo 
Park, California, to offer real-time fMRI ses- 
sions as a form of therapy. The company has 
already garnered a fair amount of attention 
and is funded by the US National Institutes of 
Health (NIH) in Bethesda, Maryland, to con- 
duct a phase II clinical trial in people with pain 
conditions such as neuralgia, fibromyalgia or 
migraines. As a control, some participants 
will see feedback from a previous partici- 
pant’s brain. Omneuron is also investigating 
feedback fMRI to help addicts combat their 
cravings. DeCharms said in a short interview 
that feedback fMRI may someday be a valu- 
able tool to ease chronic pain. Mackey has lit- 
tle to say directly about Omneuron, beyond 
cautiously wishing them good fortune. He 
adds that Stanford currently has no connec- 
tions with the company’s trials. 

“There is something different about the 
balance of goals that a company has versus a 
nominally disinterested science project,” says 
John Gabrieli, deCharms’ 
former supervisor at Stanford 
and nowat MIT. “I think that 
is partially why Mackey and 
deCharms parted ways. I 
think they couldn't find that 
balance between them.” 

Gary Glover, who directs 
Stanford’s radiological-sci- 
ences lab, was a co-author on 
the PNAS paper and is more openly sceptical 
of attempts to commercialize feedback fMRI. 
He points out that the PNAS study determined 
that the participants benefited on average. This 
does not necessarily mean that the technique 
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Nature's reporter 
prepares for pain. 


would work for any particular individual 
though. And customizing the feedback {MRI 
is generally more difficult than customizing 
the dose of a drug, because each person might 
be helped best by targeting a different brain 
region with a different mental exercise. 
Glover also worries that with such a new 
technique, no one has any idea if it carries 
side effects: perhaps after a while the pain will 
worsen, or perhaps the therapy could stop 
patients feeling useful pain in other situa- 
tions. “There are ethical questions when you 
are in the middle of someone's cognition,” 
says Glover. Linda Porter, who oversees the 
NIH programme funding Omneuron’s trial, 
says the obstacles are surmountable and that, 
compared with drug trials, the minimal risk of 
side effects is one of the appealing aspects of 
feedback {MRL 
Mackey is split on the issue of fMRI therapy. 
Asa clinician he thinks that research should be 
guided by patients’ needs and that new tech- 
nology should be released as soon as possible. 
As a researcher, he stresses that it is too early 
for clinical use. Pain therapies 
are highly subject to placebo 
effects, which often boost ini- 
tial results and then whither 
away during later trials. The 
therapy is dependent on cli- 
nicians’ ability to locate the 
correct part of the brain to 
show a patient. Much of the 
research so far has targeted 
single regions of the brain, yet it is possible that 
multiple regions are involved. 
Mackey admits that he doesn’t know what is 
required to convince him fMRI therapy is ready 
for patients. But he is working to convince 
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himself. His latest study is a feedback-fMRI 
experiment aimed not at specific brain regions, 
but at the connections between them. He is also 
experimenting with new ways to visualize the 
pain and, of course, ways to apply it. 

Itis easy to see why some are so excited about 
the possibilities of fMRI therapy. It could allow 
the vast body of research into brain imaging to 
have a direct outlet that benefits patients. If it 
works, it would be the first non-invasive, drug- 
free method of directly treating specific regions 
of the brain, with potential applications beyond 
pain relief, such as addiction and depression. 

But it is not without its shortcomings. After 
two hours in the {MRI machine, I am stiffand 
woozy. It is likely I have more pain than when 
I went in. Mackey assures me that regulat- 
ing pain is in no way attached to intelligence, 
which seems like a polite way of saying I did 
not do very well. Afterwards, we sit on a shady 
bench and talk about the potential of the mind 
to regulate itself. 

Still a bundle of energy, Mackey enthuses 
about the future of pain treatment in general, 
calling current technology “the dark ages”. 
He has no doubt that self-directed treatment 
such as feedback fMRI is the future, perhaps 
done alongside drug prescriptions and physi- 
cal therapy. 

He adds that he is looking for volunteers for 
his latest round of experiments. 

I stay quiet, and rub my arma little. 

Erik Vance is a freelancer writer based in 
Berkeley, California. 
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Brainstem tests not 
adequate to diagnose 
death in organ donors 


SIR — Your Editorial's call for 
serious discussion of laws 
governing the diagnosis of death 
(Nature 461, 570; 2009) is most 
welcome, if long overdue. It is 
imperative that those involved 

in the practice of transplantation 
know the status of organ donors. 
And it is high time that those 
offering their organs for transplant 
— “after my death” are the words 
on NHS Organ Donor Register 
application forms — are clearly 
and fairly informed about the 
state they will be in if their offer is 
taken up. 

Inthe United Kingdom, ‘death’ 
is still being certified for that 
purpose on the basis of purely 
bedside tests of some brainstem 
functions. There has never been 
sound scientific support for this 
standard, and it was declared 
“clinically dangerous” ina report 
by the US President's Council on 
Bioethics last year (see go.nature. 
com/58y3DP). 

As you assert, few things are 
as sensitive as death. Its certain 
diagnosis is surely of such 
importance that it should be 
addressed without concern for 
dependent interests. 

David W. Evans Queens’ College, 
Cambridge CB3 9ET, UK 
e-mail: dwevansmd@doctors.org.uk 


Readers are welcome to join 
the online discussion about this 
Editorial at Nature Network, 
go.nature.com/WjUiku. 


Funding on ‘Sheriff of 
Nottingham’ model 
could cut productivity 


SIR — With a view to propelling 
Canada to the top tier of 
international rankings, the 
presidents of five large Canadian 
universities have proposed a 
radical restructuring of post- 
secondary education. This 
includes the concentration 
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of research in a few elite 
universities, with the others 
focusing on undergraduate 
education. Predictably, the 
country’s other universities 
are outraged (see go.nature. 
com/6MMcsl). 

The ‘big five’ universities 
(McGill, Montréal, British 
Columbia, Alberta and Toronto) 
already receive about 40% of 
the country’s federal research 
funding. The ‘Sheriff of 
Nottingham’ model — which, 
like Robin Hood's adversary, 
robs the poor to pay the rich — 
relies on the premise that 
productivity increases faster 
than dollars invested. If this 
concept is valid, there should 
be more bang for the buck 
when funding is concentrated 
in a few institutions, rather than 
spread broadly. 

Research productivity can be 
gauged by number of published 
papers and by the h-index, 
which measures highly cited 
publications, although these 
indicators have their flaws. 
| calculated both for all the 
researchers at 27 Canadian 
universities using data from the 
IS| Web of Science, based on 
papers published from 2005 to 
2009. | extracted 2009 research 
funding per university from the 
three federal granting councils’ 
websites. 

According to these two indices, 
total research productivity and its 
impact both relate very strongly 
to the total funding received per 
university, but both relationships 
are significantly decelerating. 
(Details of these calculations are 
available from the author.) 
Expressed per dollar invested, 
research productivity and impact 
both decrease as funding per 
university increases. Bang for 
the research buck is better in 
smaller institutions. 

Concentrating funding in large 
universities would probably 
increase their productivity, and 
perhaps their bragging rights. 
However, this would come at 
the price of reduced total 
research productivity summed 
over all Canadian universities. 


The same conclusion would 
probably apply to any system 

in which finite resources for 
research must be divided among 
a pool of contenders. 

David Currie Biology Department, 
University of Ottawa, 30 Marie Curie 
Private, Ottawa, Ontario K1N 6N5, 
Canada 

e-mail: david.currie@uottawa.ca 


A playful side to 
twelfth-century 
mathematics 


SIR — As editors of the book 
Lilavati's Daughters: The Women 
Scientists of India, reviewed by 
Asha Gopinathan (Nature 460, 
1082; 2009), we would like to 
elaborate on the background to 
its title. 

Lilavati was a mathematical 
treatise of the twelfth century, 
composed by the mathematician 
and astronomer Bhaskaracharya 
(1114-85) — also known as 
Bhaskara Il — who was a teacher 
of repute and author of several 
other texts. The name Lilavati, 
which literally means ‘playful’, 
is a surprising title for an early 
scientific book. Some of the 
mathematical problems posed 
in the book are in verse form, 
and are addressed to a girl, the 
eponymous Lilavati. 

However, there is little real 
evidence concerning Lilavati's 
historicity. Tradition holds that 
she was Bhaskaracharya's 
daughter and that he wrote the 
treatise to console her after an 
accident that left her unable to 
marry. But this could be a later 
interpolation, as the idea was 
first mentioned in a Persian 
commentary. An alternative 
view has it that Lilavati was 
married at an inauspicious 
time and was widowed shortly 
afterwards. 

Other sources have implied 
that Lilavati was Bhaskaracharya's 
wife, or even one of his students 
— raising the possibility that 
women in parts of the Indian 
subcontinent could have 
participated in higher education 
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as early as eight centuries ago. 
However, given that Bhaskara 
was a poet and pedagogue, it is 
also possible that he chose to 
address his mathematical 
problems to a doe-eyed girl 
simply as a whimsical and 
charming literary device. 
Ram Ramaswamy Jawaharlal Nehru 
University, New Delhi 110 067, India 
e-mail: r.ramaswamy@mail.jnu.ac.in 
Rohini Godbole Indian Institute of 
Science, Bangalore 560 012, India 


Weighing up NICE 
against private 
health-care schemes 


SIR — There is little question 
that the US health-care system 
requires reform. But it is 
debatable whether abdicating 
personal health-care decisions 
to an organization like the UK 
National Institute for Health and 
Clinical Excellence (NICE), as you 
recommend in your Editorials 
(Nature 461, 315-316 and 847; 
2009), is the best approach to 
reform. 

NICE decides “which of the 
available medical options is 
most effective at treating any 
given condition”. This is beyond 
reproach and deserving of the 
international accolades the 
organization has received. 
However, NICE also decides 
“which [medical option] is worth 
the money”. To many Americans, 
this is objectionable. It is a 
subjective assessment intimately 
tied to the individual and shouldn't 
be in the hands of a committee. 

There are myriad reform 
schemes being debated that still 
preserve an individual's control 
over health-care decisions. A 
poorly implemented private care 
scheme can always be reformed, 
but choosing government control 
means there is no turning back. 
Todd A. Gibson University of Colorado 
Denver, Box 6511, MS 8303, Aurora, 
Colorado 80045, USA 
e-mail: todd.gibson@ucdenver.edu 


Contributions may be sent to 
correspondence@nature.com. 
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Global Darwin: Eastern enchantment 


People from Egypt to Japan used Darwin's ideas to reinvent and reignite their core philosophies and religions, 
says Marwa Elshakty in the first of four weekly pieces on how evolution was received around the world. 


o other nineteenth-century 
N scientist possessed Charles Dar- 

wir's global renown. Between 
the appearance of On the Origin of 
Species in 1859 and The Descent of 
Man, and Selection in Relation to Sex 
some 12 years later, his works were 
discussed in scores of languages. Darwin noted 
in his autobiography, published in 1887, that the 
theory was debated as far afield as Japan, and 
added with some surprise that hed even seen an 
essay on the Origin in Hebrew showing that “the 
theory is contained in the Old Testament!” 

His worldwide fame was, in part, thanks to 
technology. The first telegraphic cables were 
laid across the Atlantic Ocean floor around the 
time the Origin was published, and the next 
two decades saw Europe connected in the same 
way to India, China and Australasia. Mean- 
while, mechanical advances in paper making 
and printing helped to move ideas across the 
globe at record speeds. 

Yet the main reason for the worldwide success 
of Darwin's ideas was the ease with which they 
were assimilated into local traditions of thought 
— as the example of the Jewish attempt to 
reconcile science with scripture hints. Although 
Darwin himself may have found such recon- 
ciliation surprising, it was certainly not as 
unusual as he might have imagined. Scholars 
from Calcutta to Tokyo and Beijing constructed 
their own lineage for the theory of evolution by 
natural selection, tracing it to older and more 
familiar schools of thought and claiming owner- 
ship of what they saw as the precursors to these 
ideas. Although some, particularly in Europe, 
saw Darwin asa weapon beating down religious 
beliefs, around the world he was as much a force 
for religious resurgence and revivification as for 
religious scepticism. Even nineteenth-century 
Muslim thinkers reconciled Darwinian ideas 
with their own past religious and philosophical 
texts; which may seem ironic, given the rise of 
Muslim creationists today. 


Cosmic order 
Take as one example the work of Chinese 
scholar Yan Fu. In the late 1890s, Yan pub- 
lished a popular translation of Thomas 
Huxley’s Evolution and Ethics in which he 
reinterpreted both Huxley and Darwin in the 
light of Confucian ethical debates. 

Huxley, one of Darwin’s most vocal 
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supporters, had argued that humans 
acted against the natural order of 
things when putting the interests 
of others above themselves. But for 
Yan, this gloomy view of nature ran 
counter to what he understood to be 
Darwin's — and Confucius’s — belief 
in the perfectibility of the cosmic order. Echo- 
ing older Confucian ethical debates while 
drawing on his own reading of Darwin and 
other Victorian naturalists, Yan argued that 
selfishness and selflessness were part of the 
natural order, and that each has its place in 
the journey towards an ideal state: the key is 
to achieve the right balance between the two. 
This was how Darwin effectively gave Yan, and 
many of Yan’s readers, new licence to endorse 
one of Confucianism’ ethical prescriptions. 

Darwin's ideas were similarly used by late- 
nineteenth-century Bengali intelligentsia to 
support long-standing Hindu cosmological 
beliefs. Some of these thinkers wrote of how 
modern theories of positivism (the idea that 
true knowledge is that based on verifiable 
sensory experience) and evolutionism had ech- 
oes in Hindu theories of creation. 

For example, Satish Mukherjee, a leading 
member of the Indian Positivist Society, saw 
Samkhya, one of the oldest schools of Hindu 
philosophy, as a precursor to the modern 
view of evolution. Under Samkhya, the world 
unfolds as a result ofa continual cycle between 
creation and dissolution: 
consciousness, self or 
spirit becomes realized in 
matter and then separated 
from it, and so on. These 
cycles are seen to account 
for the creation of species 
as well as for the evolution of different stages 
of the Universe. For Mukherjee, as for many 
later Indian thinkers, Samkhya was therefore 
the theory of evolution applied to the entire 
cosmos. 

Muslim readers found their heritage in 
Darwin's theory too. Supporters and crit- 
ics pointed out that Muslim philosophers 
had long referred to the idea that species or 
‘kinds; as the Arabic term anwa‘suggests, 
could change over time. For this reason the 
great classics of early Muslim philosophy and 
cosmology were almost always cited whenever 
Darwin was discussed in Arabic, Farsi or Urdu. 
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“By embracing Darwin's 
ideas, they emphasized that 
Christianity alone wasin 
conflict with science.” 


Muslim writings from the tenth and eleventh 
centuries referred to a hierarchy of beings, 
from minerals to flora and fauna, and even 
argued that apes were lower forms of humans 
— more evidence for nineteenth-century Mus- 
lims that Darwin's theory was ‘nothing new. 


Empire and evolution 

One of the driving forces behind many of 
these scholars’ work was a desire to push back 
against the forces of Western imperialism. At 
the height of European imperial power, claims 
about white superiority were widespread. In 
response, defenders of non-Western faiths 
drew attention to the greater rationality of 
their creeds to defend themselves against 
Western charges of backwardness and super- 
stition. Many were keen to show that their 
traditions, unlike those of Western Europe, 
accepted, reinforced or had even anticipated 
the findings of modern science. By embracing 
Darwin's ideas, they emphasized that Christi- 
anity alone was in conflict with science. 

Muhammad Abduh, the Grand Mufti of 
Egypt, for instance, was worried about the 
inroads that missionaries had made into the 
educational system of the Muslim Ottoman 
lands. He was also tired of critics pointing to 
Islam's supposed inability to accommodate 
modern pedagogy and science. In Science and 
Civilization in Christianity and Islam (1902), 
Abduh argued that, in contrast to Christianity, 
Islam was free of the con- 
flict with science that had 
so violently plagued Chris- 
tian civilization in Europe. 
To stress this difference, he 
repeatedly wove references 
to Darwin and evolution 
into lectures on the exegesis of the Koran. 

Although many used Darwin to highlight 
the glory of their founding civilizations, they 
also co-opted his theory to explain their falling 
behind the Western world in modern times. It 
was seen as a way to explain both the rise of the 
West's technological and imperial superiority 
in the present, and the path to success for the 
rest of the world in the future. 

At the height of the scramble for Africa in 
1899, for instance, the Egyptian intellectual 
and women’s-rights advocate Qasim Amin 
warned that “Western civilization, speeded 
by steam and electricity, is advancing and 
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has expanded from its origins to all parts of 
the earth”. The weak, he warned, would be 
unable to survive the onslaught. For civil serv- 
ant Amin, this meant that social reform was 
needed. ‘Self-strengthening’ state reformers 
in Korea and Indian nationalists in the early 
twentieth century felt much the same way, and 
they too turned to evolution’s advocates for 
instruction while pushing key governmental 
reforms. Of course, the battle cry of intellectu- 
als was not always heeded. 

In promoting political ‘evolution, most of 
Darwin's proponents outside Europe sub- 
scribed not to revolution, but to change of a 
very gradual sort, mimicking the step-by-step 
slow change of natural selection. 

Hiroyuki Kato, an instructor of law at the 
Tokyo Imperial University, used Darwin's 
theory to defend Japan's imperial rule at the 
beginning of the twentieth century. At that 
time, a rise of democratic movements was 
challenging the power of the Emperor Meiji. 
Kato, who also gave weekly lectures to the 
Emperor on constitutional and international 
law, supported a strongly centralized imperial 
line of rule. He found in Darwinism a new lan- 
guage in which to dress his arguments and a 
scientific explanation for why radical change 
wasnt the answer to Japan's problems. 

Kato reinterpreted Darwin’ ‘struggle for life’ 
as a slow, steady ‘struggle for ethics: The ethic 
he favoured could be counted as part of the 
samurai principle of self-sacrifice, which in 
this case he took to mean absolute allegiance 
to the Emperor above all other commitments. 
Just as through death the samurai was said 
to become the perfect winner, so the ulti- 
mate victor in the struggle for ethics was the 


martyr dying for the sake of something bigger. 
This demonstrates another characteristic 
common to non-European responses to Dar- 
winism: the real question most saw lurking 
behind the theory of evolution was whether one 
could draw a moral code from nature. For Kato 
as for so many others, mere survival was not 
enough to comprise a true ethics — evolution- 
ary or otherwise. There had to be something 
beyond life to give life itself purpose. As Mus- 
lim reformer Muhammad Iqbal later put it, the 
main problem with Darwin's view of evolution 
was that it gave death ‘no constructive mean- 
ing. Perhaps for this reason, many attached 
their own meanings and linked Darwin to long- 
standing ethical systems of their own. 


Paragon of scepticism? 

If the ease with which Darwin’s ideas were 
assimilated into local traditions of thought 
is little known today, it is because much of 
the discussion about Darwin in the West has 
focused on the supposed clash between his 
theory of evolution and Christianity. Cer- 
tainly, ever since 1859, Darwin's name has been 
invoked by supporters of the forces of science 
in their battle against religion, and the image 
of Darwin as a paragon of religious scepticism 
has helped him to become an enduring icon of 
the modern sciences. 

Darwin's theory did indeed help to sharpen 
the sense of a boundary between ideas of 
science and of religious faith. For disciples 
such as Huxley, Darwin's empirical approach 
offered a way to distinguish knowledge from 
belief, or fact from fiction. The Church of Eng- 
land, along with many other establishments, 
fought back: bishops preached that to believe 


© 2009 Macmillan Publishers Limited. All rights reserved 


OPINION 


Darwin was to risk endangering one’s soul. 

Yet in truth, things were never this simple. 
Darwin was indefinite and at times incon- 
sistent on the question of religion in his own 
writings. He famously left the ultimate origin of 
species ambiguous in the last line of the Origin 
— speaking of the power of life as ‘originally 
breathed’ into one or several forms, deploying a 
key Christian metaphor for creation — and he 
often conveyed himself as an agnostic in his let- 
ters. Not all Christians recoiled from Darwin's 
ideas; some Protestants and Catholics believed 
that they too could reconcile their doctrines 
with his theory and were spurred to revisit 
their own interpretation of scripture. 

Then, as now, Darwin meant different things 
to different people. Globally, he was not so much 
arevolutionary or a scourge of faiths, as he was 
a revivifier of traditions. He straddled worlds 
between the moderns and the ancients, giving a 
new lease of life to ancient philosophers, ethical 
debates and even dynastic loyalties. 

In an age in which advocates of intelligent 
design battle to have evolution removed from 
classrooms, we would do well to recall how 
Darwin once captured and captivated the 
world — not by ridding it of the forces of 
enchantment, faith or even God, but by revi- 
talizing traditions of belief and re-enchanting 
so many. a 
Marwa Elshakry is associate professor of history 
at Columbia University, 611 Fayerweather Hall, 
New York, New York 10027, USA, and is the 
author of the forthcoming Reading Darwin in the 
Middle East (University of Chicago Press). 

See Editorial, page 1173. Further reading 
accompanies this article online. For more on 
Darwin see www.nature.com/darwin 
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The day the Internet age began 


Forty years ago today the first message was sent between computers on the ARPANET. Vinton G. Cerf, who was 
a principal programmer on the project, reflects on how our online world was shaped by its innovative origins. 


n 29 October 1969, Charley Kline, a 
() student in the Network Measurement 

Center at the University of California, 
Los Angeles (UCLA), sent the first ever mes- 
sage from one computer to another on the 
ARPANET. The other computer was in the 
Stanford Research Institute, 500 kilometres 
to the north. Kline typed the T and the ‘o’ of 
the word ‘login’ before one of the 
machines crashed. This was 
the inauspicious start to the 
ARPANET project, which led, 
ultimately, to the Internet. 

The ARPANET was a net- 
work of dedicated telephone 
circuits connecting refriger- 
ator-sized Interface Message 
Processors (IMPs). The IMPs 
formed a network that, itself, 
interconnected ‘host’ comput- 
ers on which programs were 
implemented. 

It all began in the early 1960s, 
with explorations of a radical communica- 
tion method that came to be known as ‘packet 
switching. Packets are like electronic postcards 
with ‘to and ‘from addresses and a little text. 
They share communication paths the way 
postcards share space in a postman’s van. This 
turns out to be more efficient for many short- 
lived interactions than dedicating capacity as 
in the telephone network with ‘circuit switch- 
ing. 

In the mid 1960s, Robert Taylor, director of 
the Information Processing Techniques Office 
at the US defence department’s Advanced 
Research Projects Agency (then ARPA, and now 
DARPA) launched a practical packet-switch- 
ing experiment largely to enable researchers 
at different computer-science departments 
to share software and 
resources. Led by Law- 
rence Roberts, the 
project yielded spectac- 
ular results: technologi- 
cally, intellectually and 
philosophically. 

On 2 September 
1969, the first packet-switching node — or IMP 
— was installed at the Network Measurement 
Center, as the portal into the ARPANET. At 
the time, I was a graduate student at the centre, 
along with Steve Crocker and Jonathan Postel, 
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“Preservation of the open nature 
of the Internet is possibly the 
highest imperative to emerge 

from this enterprise.” 


under director Leonard Kleinrock. Crocker led 
an aggregation of graduate students — called 
the Network Working Group — from a dozen 
institutions, who developed the programs used 
to exchange information between the comput- 
ers connected by the ARPANET. Postel became 
the ‘numbers czar’ and editor of the Request for 
Comments series that, to this day, publishes all 


The ARPANET logbook page recording the first 
computer-to-computer transmission. 


Internet standards. And I became the centre’s 
principal programmer. 

Then in October came Kline’s first test of 
what we were all working on. Happily, it was 
followed by many much more successful 
ones. 


Freedom to innovate 
Kleinrock focused much of his energy on 
mathematical modelling, and gave his gradu- 
ate students remarkable freedom. We had free 
reign to pursue our ideas with only occasional, 
very strong, critical intervention by Roberts. 
The feeling of empowerment was reinforced 
by the convening of a 
graduate-student con- 
ference parallel to the 
principal investigators’ 
meetings. 

Crocker, Postel and 
I met scores of other 
students working on 
the project elsewhere. This led to institu- 
tional cooperation and friendships that have 
lasted decades. It still sends a frisson down my 
spine to recall tramping across an open field 
in Allerton House, Illinois, with Crocker and 
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Jeff Rulifson, then a key player in the oN Line 
System (NLS) project that was a micro- World 
Wide Web of its time. We were discussing the 
premise of computer networking. We con- 
fronted the forest of ideas and confusion of 
possible paths as if we were about to launch 
into outer space. 
Out of the Network Working Group effort 
emerged the ARPANET host-to-host protocols 
including standard mechanisms for access- 
ing computers on the other side of 
the network, transferring files 
between them as well as support 
for electronic mail. E-mail was 
an unplanned development that 
became extremely popular for 
keeping distributed researchers 
in close touch. 


Crash testing 
By December 1969 the first four 
IMP nodes of the ARPANET 
were up. The other three were 
at Stanford Research Institute (later SRI Inter- 
national) in Menlo Park, California; the Uni- 
versity of California, Santa Barbara; and the 
University of Utah in Salt Lake City. Our task 
at UCLA was to measure performance and to 
compare it with Kleinrock’s models. I wrote 
the software to inject controlled traffic into 
the network and to capture the associated per- 
formance data including end-to-end delays, 
delivery rates, effects of congestion and ability 
to route around failed paths in the network. 
Around the turn of the year Robert Kahn 
and David Walden from Bolt Beranek and 
Newman (BBN) — the company that designed 
the ARPANET IMPs — came to UCLA to 
carry out performance tests on the network. 
Kahn had aired some theories about network 
failure modes that his colleagues thought 
were unlikely in practice. He and J ran a series 
of tests to explore these cases. Kahn would 
come up with a scenario and I would write the 
programs for the UCLA host to implement 
it. We managed to crash the network many 
times, showing where the internal control 
systems of the initial ARPANET implemen- 
tation needed work. I remember thinking we 
should paint a sort of network symbol on the 
side of the UCLA computer for each time we 
crashed the network, as fighter pilots used 
to do with ‘kills’ in the Second World War. 
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The extraordinary partnership forged 
between us has lasted for four decades and has 
included work on digital libraries, long-term 
storage and retrieval of digital content and 
‘knowledge robots’ that can move around in 
the Internet performing useful functions. 

Kahn organized a public demonstration 
of the ARPANET at the first International 
Conference on Computer Communication, 
held in Washington DC in October 1972. 
An IMP was installed in the ballroom of the 
Washington Hilton hotel, and the event went 
extraordinarily well. This display of the power 
of packet switching and computer network- 
ing, more generally, led ARPA and the Xerox 
Palo Alto Research Center to pursue the 
development of additional networks using 
the same principle, and spurred the growth of 
computer-to-computer communications. 

Roberts had begun exploring the use of 
packet satellite communication to link net- 
work nodes via hand-held radio terminals, 
when terrestrial lines were unavailable or too 
expensive. Kahn began to consider mobile 
radio networks as another medium. In early 
1973 the ARPANET joined two other ARPA 
projects: the Atlantic Packet Satellite Network 
(SATNET) and the Packet Radio Network 
(PRNET). By May 1973, Robert Metcalfe (for- 
merly part of the Network Working Group) 
and David Boggs at Xerox had invented the 
Ethernet for local area network communica- 
tions based, in part, on Metcalfe’s visit to the 
University of Hawaii where he learned of 
another ARPA project, the ALOHAnet. 


The Internetting problem 

At around the same time Kahn described to me 
his vision of open network interconnection. 
He foresaw the need to link together packet- 
switched networks with different designs 
so that any computers could communicate 
freely, no matter what the communication 
path. Kahn started a research programme at 
ARPA focused on this ‘Internetting’ problem. 
Over about six months, Kahn and I met many 
times and by August 1973, we had developed 
the basic concepts of what became the trans- 
mission control protocol (TCP) and the basic 
architecture of the Internet. We presented a 
paper at a meeting at the University of Sus- 
sex in Brighton, UK, in September 1973 
(published the following year as V. Cerf and 
R. Kahn IEEE Trans. Commun. 22, 637-648; 
1974) describing how to interconnect an 
arbitrarily large number of packet-switched 
networks and attached computers. 

With support from ARPA, implementation 
of the new protocol began in 1975 at defence 
technology contractor BBN, Stanford and Uni- 
versity College London. In November 1977, 


For Vinton Cerf, the Internet knows no bounds. 


we carried out a three-network test — a major 
milestone. A ‘packet radio van, built by Stanford 
Research Institute, cruised the San Francisco 
Bayshore highway, radiating packets that were 
transferred through a gateway (a computer that 
links networks) to the ARPANET and relayed 
to University College London. There they 
were relayed through another gateway to the 
Atlantic Packet Satellite Network back across 
the Atlantic to the United States where they 
re-entered the ARPANET through another 
gateway and were then transported to the 
University of Southern 
California computers in 
Los Angeles. The distance 
between the packet radio 
van and Los Angeles was 
about 500 kilometres but 
the packets actually trav- 
elled 160,000 kilometres 
over two satellite hops and twice across the 
United States. 

By 1978, these protocols, and other 
ARPANET application protocols (for e-mail, 
file transfer and remote terminal access), had 
been prepared to operate in a multi-network 
environment — the Internet. Work proceeded 
on implementations of these protocols for the 
many operating systems of machines then on 
the ARPANET. During 1982, a substantial 
effort was undertaken to implement the TCP/ 
IP protocol suite on all the machines on all the 
networks. 

In 1983 the ARPANET was split into a mili- 
tary network (MILNET) with nodes located 
mainly on military sites or other protected 
locations and the remaining ARPANET nodes 
located at universities, non-profit organiza- 
tions, research centres and some government 
sites. By 1986, the US National Science Foun- 
dation had launched its NSFNET project, using 
the TCP/IP protocols; and other efforts were 
beginning to emerge overseas. The ARPANET 
was formally decommissioned in 1990. Hosts 
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“The Internet persists 
because it is an architecture 
for interoperability rather 
thana fixed set of networks.” 


were moved to sites on the NSFNET (or its 
regional offspring). Even the NSFNET was 
retired in 1995 after sufficient commercial 
Internet service was available to serve the aca- 
demic community in the United States. 

Despite the coming and going of many con- 
stituents, the Internet persists because it is an 
architecture for interoperability rather than a 
fixed set of networks. 


Space next 

The ARPANET project demonstrated the 
remarkable flexibility and utility of packet 
switching for computer communications. The 
subsequent existence of multiple networks — 
SATNET, PRNET, Ethernet — highlighted 
the versatility of the technique on different 
communications media. During the course 
of the ARPANET development, the concept 
of ‘layering’ evolved. This is the separating of 
various communication functions into dis- 
tinguishable strata to simplify structure and 
implementation. It is layering that has permit- 
ted the Internet to absorb and use every new 
communication technology developed along 
the way. 

Neither the ARPANET nor the Internet and 
its constituent networks were purpose-built 
for particular applications. This design phi- 
losophy has allowed these 
systems to support a broad 
array of new techniques 
and applications including 
packetized voice, stream- 
ing video and, of course, 
the myriad applications 
built atop the World Wide 
Web, under the leadership of Tim Berners-Lee, 
first at CERN, the European particle-physics 
laboratory near Geneva, now at the Massachu- 
setts Institute of Technology in Cambridge. 

Preservation of the open nature of the 
Internet is possibly the highest imperative to 
emerge from this decades-long enterprise of 
the research community, governments, the 
educational community and industry. The 
application space of the Internet knows no 
bounds. Recently, it has spawned a new round 
of design for an interplanetary Internet using 
new protocols that cope with the delay and 
disruption found in space communications. If 
you can imagine it and can program it, you can 
probably make it available on the Internet — a 
freedom to innovate that has its roots in the 
original ARPANET work. a 
Vinton G. Cerf is vice-president and chief 
Internet evangelist of Google at 1600 
Amphitheatre Parkway, Mountain View, 
California 94043, USA. From 1969 to 1972 he was 
a programmer working on the ARPANET. 
e-mail: vint@google.com 
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Reassessing the father of chemistry 


“4 BN) Lsts ‘2 


~ 4 


nature 


i 


Se 


ee 


Robert Boyle's character is often obscured by the shadow of Isaac Newton, but a masterful 
biography reveals him as larger than life, explains Peter Anstey. 


by Michael Hunter 
Yale University Press: 2009. 400 pp. 
£25, $55 


In the latter half of the seventeenth century, 
Robert Boyle (1627-91) was the leading 
natural philosopher in Britain. Yet although 
historians have been piecing together a more- 
detailed profile of him in the past three dec- 
ades, his popular image extends little beyond 
the law that bears his name and his most 
famous publication, The Sceptical Chymist. As 


The Age of Wonder: How the Romantic Generation 
Discovered the Beauty and Terror of Science 

by Richard Holmes (Harper Press, £9.99) 

This award-winning book explores links between science 
and Romanticism around the start of the 1800s. “To 
guide readers through the science and culture of this 
period, Holmes masterfully dips in and out of the life of 
Joseph Banks,” wrote David Bodanis in his review of the 
hardback edition (Nature 457, 31-32; 2009). 


with his contemporaries Robert Hooke and 
Christiaan Huygens, Isaac Newton's shadow 
has obscured our view of Boyle. But previous 
biographers must share the blame for Boyle’s 
faded image, not least the first, Thomas 
Birch. Writing in the 1740s with his collabo- 
rator Henry Miles, Birch removed letters and 
whole unpublished works from Boyle's papers 
in order to perpetuate the anodyne image that 
suited the polite tastes of the day. 
Nevertheless, there is no paucity of material 
with which a biographer can work. Indeed, 
the most impressive feature of biographer 
Michael Hunter's Boyle is the meticulous care 
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with which he has combed the vast quantity of 
published and unpublished materials — includ- 
ing portraits, printed images and medallions 
— relating to Boyle's life. Hunter masterfully 
interweaves the narrative of Boyle’ intellectual 
development and scientific achievements with 
a measured assessment of Boyle's diffident, 
even convoluted, personality. 

The tale begins with Boyle’s domineering 
and ambitious father, Richard, the Earl of Cork, 
and moves through his infancy, childhood and 
Eton school years. Then follows his Grand Tour 
of Europe, on which Boyle had seminal experi- 
ences that were to shape his earnest Christian 


Descartes’ Bones: A Skeletal History of the 
Conflict Between Faith and Reason 

by Russell Shorto (Vintage Books USA) 
Russell Shorto interweaves the fate of philosopher 
René Descartes’ bones with a narrative of 
Cartesian philosophy and beliefs, also exploring 
“the history of the uncomfortable relationship 
between Catholicism and Cartesianism”, wrote 
Lisa Jardine (Nature 455, 863-864; 2008). 
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faith and his early intellectual trajectory, 
and from which he emerged as a precocious 
adolescent. 

Perhaps surprisingly, Boyle’ first exploits as 
a writer were directed to moral and devotional 
topics. But at the age of 22 he was “transported 
and bewitchd” by experimental chemistry and 
never looked back. So began a life dedicated to 
the study of nature: a life that was funded by the 
substantial means he inherited from his father, 
and that is epitomized in the title of his popular 
later work The Christian Virtuoso. 

The most compelling chapters in Hunter's 
narrative cover Boyle's time in Oxford from 
the winter of 1655-56 and his emergence, in 
the early 1660s, as a celebrated public figure 
and emblem of the early Royal Society. These 
years were his most productive, both in terms 
of experimental results and written output: 
from 1660 to 1666, he published a dozen 
books at an average of 140,000 words per 
year. Other works took shape in this period, 
emerging in later decades; and still others have 
only recently been unearthed and published in 
the definitive 14-volume The Works of Robert 
Boyle (Pickering and Chatto, 1999-2000), of 
which Hunter is an editor. 

In his publications, Boyle introduced a 
new and distinctive natural philosophy called 
corpuscularianism. He also stressed the inter- 
play of theory and experiment in the construc- 
tion of natural histories, an approach that was 
to dominate British science for four decades. 
However, most significant was the series of 
innovative experiments Boyle performed with 
his air-pump, J-tube and long pipette. Through 
the clever manipulation of air and mercury and 
with careful measurement, he established that 
the pressure of the air is inversely proportional 
to its volume. Furthermore, he solved the long- 
standing problem in animal physiology as to 
the cause of air entering the lungs in respiration: 
there is a differential in air pressure between the 
expanded lungs and the atmosphere. 

Yet there is more to Boyle than the careful 
experimenter. Hunter shows how in the eyes 
of his contemporaries, from the royal court 
to savants abroad, Boyle was a larger-than- 
life character. This stemmed in part from his 


Atomic: The First War of Physics and the Secret 
History of the Atom Bomb, 1939-49 
by Jim Baggot (Icon Books, £9.99) 


overt religiosity, his reputation for professional 
integrity and his understated philanthropy. But 
itis the inner Boyle whom Hunter is most con- 
cerned to explore: Boyle the doubter, the vac- 
illator, the stuttering and conscience-stricken 
man revealed in private notes written near the 
end of his life. Hunter displays fascination and 
impartiality, even wavering respect, but in the 
final analysis it is not clear that he really likes 
Boyle. However, the biographer shows maturity 


by leaving the reader latitude to make up their 
own mind about what made Boyle tick. 

This first comprehensive work on the life of 
Boyle isa piece of stunning scholarship, a com- 
mand performance by a gifted historian. It is 
also a great read. a 
Peter Anstey teaches early modern philosophy at 
the University of Otago, Dunedin, New Zealand. 
He is the author of The Philosophy of Robert Boyle. 
e-mail: peter.anstey@otago.ac.nz 


Capturing digital | 


Total Recall: How the E-Memory Revolution 
Will Change Everything 

by Gordon Bell and Jim Gemmell 

Dutton: 2009. 304 pp. $26.95 


Delete: The Virtue of Forgetting in the 
Digital Age 

by Viktor Mayer-Schénberger 
Princeton University Press: 2009. 
256 pp. $24.95 


We are entering an era of unprecedented 
digital power. Thanks to the diminishing cost 
of digital storage and the increasing ubiquity 
of digital devices, the 
day is coming when we ve 
will be able to record 
almost every interaction 
we have. Digital micro- 
phones will capture our 
brief encounters with 
strangers; cameras 
will snap automatically 
as we enter new rooms 
or browse the web. And 
somewhere on a remote 
server farm, the images and 
sounds of our lives will pile 
up in a massive database. 
On this at least, the authors of two 
books about the social effects of digital 
data storage, Total Recall and Delete, 


THE BLACK 


Vividly written and impressively researched, Atomic 
covers the efforts of scientists and spies in the United 
States, Britain, the USSR and Nazi Germany to develop 
their own atomic weapon. Drawing on material including 
declassified British secret-service transcripts and 
documents from Soviet archives, this is a thorough but 
engaging account of the race to build the atomic bomb. 


HOLE WAR 
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agree. Where they differ is in what they think 
will happen next. Pioneering computer scien- 
tist Gordon Bell and his Microsoft colleague 
Jim Gemmell take a libertarian view in Total 
Recall. Digital media will free us to dip back 
into the past at will, they argue. Equipped with 
information that our brains may have lost, 
we will act more effectively as individuals in 
every part of our lives. In Delete, by contrast, 
information-policy expert Viktor Mayer- 
Schénberger believes people and technologies 
are inextricably woven into the fabric of insti- 
tutions. Records of our personal data could 
easily make us vulnerable to the predations of 
governments and corporations, he warns. 
Bell, the first-person narrator 
of Total Recall, acknowledges the 
possibility that malefactors might 
abuse our information, but this is a 
long way from his primary concern. 
His book is largely a chronicle of 
the delight he has experienced in 
spending the past decade gathering 
data about himself, a process he calls 
“lifelogging”. At the beginning, this 
involved simply scanning documents 
and photographs onto a hard drive. 
Then he began to look for ways 
to capture his experiences as 
they happened. In 2001, this 
resulted in the birth of the 
MyLifeBits project at Micro- 
soft Research. Bell’s team, 


The Black Hole War: My Battle with Stephen 
Hawking to Make the World Safe for 
Quantum Mechanics 

by Leonard Susskind (Little, Brown, £12.99) 
Leonard Susskind charts his long conflict with 
Stephen Hawking over the fate of information ina 
black hole. Paul Davies's review noted it “skilfully 
explains the subtleties of the physics that underlie 
the issue, and includes anecdotes to enliven the 
technical details.” (Nature 454, 579-580; 2008.) 
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which includes software designer 
and co-author Jim Gemmell, 
has developed a suite of dig- 
ital tools for recording, storing 
and searching everything from 
old family photographs to kerbside 
chats. 

Today, Bell wears a ‘SenseCam’ 
that automatically takes pictures 
every time a sensor on the device 
registers something that he might 
want recalled: a warm body, or 
a change in light suggesting a 
change of place. His desktop 
computer records his every 
keystroke. When he travels, a 
portable Global Positioning 
System continuously reports his 
location to MyLifeBits, which 
among other things allows him to 
log the time, date and place of any 
images he takes. Occasionally, he 
likes to play back these images in 
rapid-fire succession. “Talk about 
your life flashing before your eyes!” 
he enthuses. 

With no shortage of hubris, 
Bell views the sort of tools he and 
his team are developing as evolu- 
tionary scaffolding. Humans are 
cursed with messy, organic brains 
that are inclined to forget, he argues. 
We have tried to mitigate this with 
memory technologies: first language, then 
writing, now computing. In fact, Bell asserts, 
“the arc of human development from the Stone 
Age through the present can be seen as an 
ongoing quest for Total Recall”. 

This, of course, is good news for Microsoft 
and the MyLifeBits team, who are developing 
the digital gear that makes such total recall 
possible. For all Bell’s visionary thunder, it is 
hard not to hear in his writing the voice of the 
corporate salesman. He and Gemmell have 
described an imminent techno-utopia in which 
the technology-enabled individual moves the 
entire species forward to a new stage of socio- 
technical evolution. Yet at ground level, the 
book mostly offers a simple, plain-spoken 


Complexity Theory 


Simply Complexity: A Clear Guide to 


by Neil Johnson (Oneworld, £9.99) 

The science of complexity is still a fledgling field, but one 
that is on the rise. In this book, Neil Johnson introduces 
complexity, explaining what it is and how it affects us, 
before describing how complexity science can be used in 
a number of ways, from fighting disease to relationships. 
He also shows how, in the future, it may shed light on our 
understanding of quantum physics and more. — 


account of how we can all become better at 
everything if we first become better consum- 
ers of digital technology. 

Mayer-Schoénberger greets this vision with 
a polite and scholarly ‘humbug’ Like Bell, he 
believes people will soon be able to capture 
almost limitless data about their lives. Unlike 
Bell, he does not equate memory with recall. 
Rather, he sees it as a process of reconstruct- 
ing the past in order to act in the present. 
Likewise, he suggests that individual agency 
is more than tool-enhanced willpower. It, too, 


Lewis Carroll 
in Numberland 
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is a construction, built from both 
personal and social resources. 
From this perspective, Mayer- 

Schénberger raises questions about 
“~~ the power of technology and how 

it affects our interpretation of time 

— a topic that Bell largely ignores. He 
~, dramatically undermines Bell’s case 
for the value of being able to behold 
all the times of our lives at once. He 
draws ona rich body of contemporary 
psychological theory to argue that both 
individuals and societies are obliged 
to rewrite or eliminate elements 

of the past that would render 
action in the present impossi- 
ble: a job applicant must put their 
former drug addiction out of their 
mind, a nation must stop dwelling on 
the cruelty of its former enemies to 
make peace. Here lies the central irony 
of our contemporary technological 
situation: “Through perfect memory, 
we may lose a fundamental human 
capacity — to live and act firmly in the 
present,” he writes. 
Mayer-Schénberger also fears that 
ubiquitous recall technologies would 

give institutions the power to keep a 
close eye on us at all times. As a result, 

individuals would start censoring them- 
selves lest their actions and words be used 
against them, leading to a kind of social stasis. 
For him, Bell’s proposed solution — security 
systems that allow users full control over their 
data — does not go far enough. He will only 
be satisfied if the devices that store the data 
are programmed to destroy it regularly and 
automatically. 

If Mayer-Schénberger is right — and I’m con- 
vinced he is — then the old Kris Kristofferson 
song might be true after all: in the future, free- 
dom could be just another word for nothing 
left to lose. a 
Fred Turner is assistant professor of 
communication at Stanford University, Stanford, 
California 94305-2050, USA, and author of 
From Counterculture to Cyberculture. 
e-mail: fturner@stanford.edu 


Lewis Carroll in Numberland: His Fantastical 
Mathematical Logical Life 

by Robin Wilson (Penguin, £9.99) 

Although better known for his fiction, Lewis 
Carroll's achievements as a mathematician should 
not be overlooked. Robin Wilson's book “conjures 
the spirit of aman who delighted in paradox yet 
insisted on precision ...and who wanted most of all 
to stump everyone he knew”, wrote reviewer Jascha 
Hoffman (Nature 454, 580-581; 2008). 
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Explorer of the deep 


Jacques Cousteau: The Sea King 
by Brad Matsen 
Pantheon: 2009. 336 pp. $27.95 


Pioneer of marine conservation Jacques- Yves 
Cousteau — affectionately dubbed JYC, 
Captain Cousteau, Captain Planet, the Sea 
King or simply ‘the man with the red cap’ — is 
known worldwide for his exploration of the 
ocean and his success in popularizing its won- 
ders. Yet those who mistrust his fame often 
question whether his work was as valuable 
scientifically as he made out. 

Cousteau’s colourful life has already 
inspired several works in French and English, 
including two personal accounts by members 
of the crew of the RV Calypso, the legendary 
ship he used as an expedition vessel and 
research lab from 1950 until his death in 1997. 
The latest contribution is The Sea King by 
Brad Matsen, who has been writing books and 
documentary scripts about the sea and deep 
ocean for 30 years. Matsen pays only limited 
regard to Cousteau’s scientific achievements, 
although he claims to go further than all pre- 
vious biographies of the explorer, telling the 
complete story of his life and throwing new 
light on a complex personality. In this, at least, 
he largely succeeds. 

The book contains many little-known details 
of events from Cousteau’s life obtained from 
interviews with relatives and collaborators. 
Matsen accurately describes the creation of the 
aqualung, the first free-swimming underwater 
breathing equipment that Cousteau developed 
with the engineer Emile Gagnan in 1943, as 
well as his other technological contributions 
to underwater exploration. He describes many 
of the Calypso’s expeditions, dwelling at some 
length on the people who played a promi- 
nent part in them alongside Cousteau. These 
include Philippe Tailliez and Frédéric Dumas, 
known with Cousteau as the three Mousque- 
mers, or musketeers of the sea; his first wife 
and business partner Simone Melchior, also 


Leviathan: Or, the Whale 


mysterious. 


by Philip Hoare (Fourth Estate, £8.99) 

Philip Hoare explores the whale and its significance 
to humans. Meandering through biology, economics, 
cultural history and his own obsession with the 
creatures, he describes everything from the 
possibility of us surviving in their bellies to gritty 
details about the nineteenth-century whale trade, 
concluding that much about the whale remains 


nicknamed La Bergére, 
or the shepherdess; his 
two sons; and other 
long-serving members of 
the Calypso crew. 

Cousteau’s story is mostly 
one of groundbreaking adventure, but 
Matsen is not afraid to delve into the 
dark side of his life and character, 
something previous biographers 
have been reluctant to do. 
There is plenty to chew on: 
his sometimes difficult rela- 
tionships with his sons, 
his persistent financial 
problems, the fate of 
the Calypso (currently 
being refurbished in 
Brittany), the contro- 
versial role of his second wife Fran- 
cine and the death of his son Philippe 
in a seaplane accident in 
1979. He also chronicles 
the sad death from cancer 
of La Bergére, who was the 
soul of the Calypso and had 
a key role in many of its 
voyages. 

One drawback of the book is that whereas 
Matsen covers most of the Calypso’s expedi- 
tions in detail, inexplicably he devotes only 
two lines to Cousteau’s Antarctic trip in 
1972-73. According to previous accounts, 
the Calypso’s crew considered this journey 
their most remarkable. It was one of the last 
on which Cousteau was personally present, 
directing operations with his wife Simone. 
It was also the most risky: the Calypso’s 
wooden hull was not built for navigation in 
pack ice. The films that the crew shot of the 
Antarctic environment were seen by millions 
and the expedition was probably decisive in 
Cousteau’s involvement in the Protocol on 
Environmental Protection to the Antarctic 
Treaty of 1991, which designated Antarctica 
as a nature reserve for 50 years. 
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fe 


The wide popularity of Cousteau’s 
films and television series may have 
been one reason why he was some- 
times accused of overplaying the scientific 
value of his work — a largely unfounded criti- 
cism born of jealousy and misunderstanding. 
True, Cousteau’s expeditions usually included 
professional scientists, and their contributions 
often came second to the storytelling. Yet he 
made a hugely important contribution to 
marine science; first, by developing technolo- 
gies that enabled people to make observations 
and carry out experiments in situ underwater; 
second, by the interest he aroused worldwide 
in the sea and the damage being done to it by 
humans; and third, by inspiring numerous 
young enthusiasts to become marine scientists 
and professional divers. 

Matsen, like so many others, is rather 
seduced by Cousteau’s popular image and 
ignores many of the scientific consequences 


Witness to Extinction: How We Failed to 
Save the Yangtze River Dolphin 

by Samuel Turvey (Oxford Univ. Press, £8.99) 
Naturalist Samuel Turvey gives a personal account 
of the 2006 survey that determined the baiji 
dolphin was extinct. Describing it as a “godawful, 
soul-destroying experience”, he touches on the 
significance of the baiji’s extinction, local myths of 
its origin, the failed preservation project and other 
cetaceans such as the endangered vaquita. 
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of his work. He could have done more to 
highlight them. Most of Cousteau’s first expedi- 
tions on the Calypso were predominantly scien- 
tific, and during the first years of his ownership 
she was the only French oceanographic ship, 
offering scientists the possibility of making 
direct observations down to 300 metres for the 
first time. Sponsored by the French National 
Centre of Scientific Research, his expeditions 
to the Mediterranean, the Red Sea and the 
Atlantic resulted in numerous publications, 


most of which are collected in the 11 volumes 
of the series Résultats Scientifiques des Cam- 
pagnes de la Calypso, which contain important 
contributions to marine science. Cousteau was 
foremost an explorer, but his contribution to 
science was immense. | 
Jean Vacelet is a marine biologist at the Centre 
d'Océanologie de Marseille, Université de la 
Méditerranée, Marseille, France. He took part in 
several of the Calypso's scientific expeditions. 
e-mail: jean.vacelet@univmed.fr 


Darwin's puppy love 


Darwin's Dogs: How Darwin's Pets Helped 
Form a World-Changing Theory of Evolution 
by Emma Townshend 

Frances Lincoln: 2009. 144 pp. 

$14.95, £8.99 


Even the most ardent fan of Charles Darwin 
might be feeling weary as his anniversary year 
draws to a close. Publishers have seemingly 
explored every corner of Darwin's life: his 
youth, his marriage, his attitudes to slav- 
ery and religion. Emma Townshend adds 
a fascinating angle — Darwin's love of 
dogs. Dogs were Darwin’s constant com- 
panions from boyhood to old age. They were 
also the animals closest to hand when he 
explored the implications of his theories. It 

is surely not coincidental that Darwin’ credo 
was “it’s dogged as does it”. 

In Darwin’ Dogs, Townshend adds little 
new to the Darwin biography. Yet her close 
reading of his correspondence, filtered to 
references to the family’s dogs, produces a 
warmer, more intimate portrait than others 
so far. She plausibly claims that, aside from 
his years at boarding school and on the aptly 
named ship HMS Beagle, Darwin spent every 
day of his life in the company of dogs. 

Motherless at eight years of age and packed 
off to boarding school, the young Darwin had, 
by his own admission, a “passion” for dogs, 
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Creation: The True Story of Charles Darwin 

by Randal Keynes (John Murray, £7.99) 

Originally titled Annie’s Box, Randal Keynes's renamed 
and re-released book Creation focuses on Charles 
Darwin's relationship with his daughter Annie and how 
her death subsequently affected his research. “Keynes 
weaves a rich tapestry that gives the reader a sense of the 
attitudes and assumptions of the Darwin family and their 
class,” explained Bruce Weber in a review of the hardback 
edition of Annie’s Box (Nature 411, 739-740; 2001). 


and his letters home to his three older sisters 
are packed with affectionate banter about the 
animals. Writing of how much he missed his 
family’s dogs, and in turn being told by his 
sisters how much the dogs missed him, was a 
face-saving way for a young man to admit his 
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homesickness and exchange affection without 
embarrassment. 

On his first morning back home after five 
years on the Beagle, Darwin went straight to 
the stables to see how his old “savage” dog, 
“averse to all strangers’, would react to his 
return. Would the dog treat Darwin peaceably 
as befitted someone familiar, or would it growl 
at him showing that it had forgotten its mas- 
ter? As Darwin later recalled in The Descent 
of Man, the dog, “obeyed me, exactly as if I 
had parted with him only half an hour before. 
A train of old associations, dormant during five 
years, had thus been instantaneously awakened 
in his mind” 

As Darwin's thoughts turned to ‘trans- 
mutatior of species, the actions of dog breed- 
ers intrigued him. By carefully selecting those 
animals best suited to their purposes to form 
the parents of the next generation, breeders 
offered Darwin a metaphor — artificial selec- 
tion — from which he could derive his great 
guiding principle of natural selection. Dog 
breeders were especially important to Darwin 
in trying to understand the sources of pheno- 
typic variability and how varieties bred true to 
type — questions that were resolved long after 

Darwin's death. 
The other major issue with which 
Darwin grappled on his return to the 
United Kingdom was that of finding a 
wife. There too, canine thoughts were never 
far from his mind. When he listed for himself 
the pros and cons of a married life, he noted 
the companionship of a wife to be “better 
than a dog, anyhow” In Townshend's narra- 
tive, we can see in this comment affectionate 
praise rather than insult. 

On the Origin of Species opens with a 
discussion of domesticated animals. When 
Darwin came to his magnum opus on 
humankind, The Descent of Man, and Selec- 
tion in Relation to Sex, dogs again have cen- 

tre stage. Dogs, for Darwin, know happiness 
and sadness, grumpiness, kindness and loy- 

i \ alty. They understand language — Darwin 
pressed his neighbour, Sir John Lubbock, 

into testing the latter’s dog’s vocabulary 
— and they have a sense of humour. The 


Strange Fruit: Why Both Sides Are Wrong 

in the Race Debate 

by Kenan Malik (Oneworld, £10.99) 

The subject of race is often controversial but, Kenan 
Malik argues, we shouldn't avoid thinking about it. He 
attempts to describe what race is and is not, froma 
biological and cultural perspective, covering modern 
disputes such as the US approval of a drug for African 
Americans with heart disease. He also looks at 
historical views on race and its treatment today. 
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concept of property ownership, Darwin argued, 
“js common to every dog with a bone”. With a 
directness and candour that still shocks today, 
Darwin mused that a dog's “deep love ... for his 
master, associated with complete submission, 
some fear, and perhaps other feelings” prefig- 
ures human feelings of religious devotion. 
Townshend shows a deft touch with a 
considerable body of Darwin scholarship. 
However, her simplified account of how 
scientific attitudes to dog behaviour have 
changed since Darwin is less secure. She mainly 
blames “behaviourists” in animal psychology 
for ruling inadmissible Darwin’s sympathetic 
attribution of human qualities to dogs, noting 
that anthropomorphism has been reinstated 


in recent years by “cognitive psychologists”. 
However, the rejection of anthropomorphism 
was not limited to behaviourists and encom- 
passed all forms of animal-behaviour study 
in the mid-twentieth century. Debate over 
the degree to which scientific terms used to 
describe human behaviour can be applied to 
animals continues to this day. 

Allin all, Darwin’ Dogs is thoroughly enter- 
taining and informative. It is the ideal antidote 
to Darwin fatigue. a 
Clive Wynne is associate professor of psychology 
at the University of Florida, PO Box 112250, 
Gainesville, Florida 32601, USA, and author of 
Do Animals Think? 
e-mail: wynne@ufl.edu 


Forgotten treasure seeker 


The Fossil Hunter: Dinosaurs, Evolution, and 
the Woman Whose Discoveries Changed 
The World 

by Shelley Emling 

Palgrave Macmillan: 2009. 256 pp. 
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Remarkable Creatures 

by Tracy Chevalier 
Dutton/HarperCollins: 2009. 352 pp. 
$26.95/£15.99 


Until recently, histories of science were 
written almost entirely by, for and about men. 
The nineteenth-century hunt for Jurassic-era 
fossils along the beaches of the British town 
of Lyme Regis was no different. Although the 
names of naturalists such as Georges Cuvier, 
William Buckland and Richard Owen who 
used the fossils to overturn society’s ideas 
about life on Earth are familiar, that of Mary 
Anning is only beginning to be exhumed. The 
publication of two books about her life — one 
factual, one fictional — will raise her profile in 
the public imagination. 

Anning was a poor, working-class twelve- 
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year-old when she made her first major discov- 
ery within the rocks of the perilous sea cliffs 
in 1811: the first complete skeleton of an ich- 
thyosaur. She went on to uncover many other 
important fossils, such as the first plesiosaur 
and the first complete skeleton of the winged 
reptile Dimorphodon macronyx. Collecting and 
selling small fossils to earn a living, she also led 
fossil hunts for naturalists visiting Lyme Regis. 

Anning’s discoveries made it into the local 
newspapers. But it was the wealthy collectors 
and the established naturalists championing her 
finds in the halls of the Geological Society whose 
names became associated with them. Although 
her fossils helped overturn the popular idea 
that Earth and all its inhabitants were created in 
six days in 4004 Bc, paving the way for Charles 
Darwin's great synthesis in 1859, Anning wasn't 
mentioned in key publications or lectures. 

In her diligent biography The Fossil Hunter, 
Shelley Emling explains that in Anning’s day 
women had no place in the cut and thrust of 
science. Urged not to appear outdoors with- 
out a chaperone, women were barred from 
places where learned debates took place and 
were thought to lack the intellectual rigour or 
stamina for fieldwork. Despite this, women 


The Earth After Us: What Legacy Will Humans 


by Jan Zalasiewicz (Oxford Univ. Press, £8.99) 

Using the imagined concept of extraterrestrial beings 
examining Earth for evidence that humanity ever existed, 
geologist Jan Zalasiewicz looks at what we might leave 
behind in the geological record. Describing the evidence 
we have for Earth's past, he explores our effects on 

the world and puts them in perspective over the vast 
timescale of the planet's history. 
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did make vital contributions. Anning shared 
the beaches of Lyme Regis with three other 
female fossil collectors — the middle-class 
Philpott sisters, notably Elizabeth, who made 
well-regarded finds. And Emling describes the 
activity of two other talented nineteenth-cen- 
tury women, the wives of geologists William 
Buckland and Roderick Murchison, whose 
contributions included sketching and labelling 
of geological samples on expeditions. 

But Anning was more than a collector or 
helper — she was a true scientist. She 
reconstructed and cleaned her a 
own finds. She devoured \ ost 
scientific articles, often pf Sy \ 
painstakingly copying out 
the entire text and fig- 
ures. She engaged in 
spirited discussions 
with the men who 


of AM ao 


tf / fo 
sought her expertise Ef Hy kts 
and her samples. — 
She dissected living 


sea creatures on the kitchen 
table to better understand 
the anatomy of their long- 
dead counterparts. She 
even conducted research, 
surmising, for example, 

that the rock-like bodies she often found within 
the skeletons she uncovered — coprolites — 
were hardened faeces. Together with William 
Buckland, she reconstituted coprolites in her 
workshop and deduced what the animals had 
been eating. 

Like many retrospective narratives, Anning’s 
story has its heroes and anti-heroes, set-pieces 
and eureka moments. In Remarkable Creatures, 
Tracy Chevalier uses these devices to construct 
a fictionalized account of Anning’s life. Able to 
make things up when the details are shrouded 
in obscurity, Chevalier’s engaging version 
easily wins out over Emling’s more faithful 
biography. Chevalier’s unconstrained hand 
lets one suspend disbelief, such as in “[Buck- 
land] asked so many questions... that I began 
to feel like a pebble rolled back and forth in 
the tide”. By contrast, confined by the facts, 
Emling’s wearing reliance on the conditional 


Deep Economy: Economics as if 

the World Mattered 

by Bill McKibben (Oneworld, £9.99) 

Bill McKibben calls for a new focus on developing 
local, rather than global, economies — advocating 
that cities and regions should produce more of 
their own food, energy and culture. Such small, 
local economies, he argues, offer a greater sense of 
community and satisfaction, and better protection 
against an increasingly uncertain future. 


1211 


foe 


NATURE|Vol 461/29 October 2009 


AUTUMN BOOKS OPINION 


perfect — in sentences such as “The bracing 
early-morning air would have invigorated 
Mary’s senses” — soon begins to grate. 
Emling’s biography is the more thorough 
and complete work. It also frees us from the 
claustrophobic atmosphere of Lyme Regis, 
providing the context of discoveries happen- 
ing elsewhere. Chevalier’s tale glosses over 
most of Anning’s later life. But so accurate was 
her fictional rendering that I felt I was reading 


the same book twice. Both works did, however, 
lack a gripping plot. Emling’s solution was to 
incorporate peripheral dramas, such as natu- 
ral disasters befalling Lyme Regis. Chevalier’s 
strategy was to sneak Elizabeth Philpott into a 
key session of the Geographical Society and to 
give Anning a putative lover. 

In the end, Anning’s life story can offer no 
more than a pleasant but sedate read, either 
in fact or fiction. More evocative was the 


drama, brought out well in both works, of 
how her discoveries shook the world: leering, 
nightmarish monsters materializing from the 
clay and hinting at a world far more ancient, 
savage and uncaring than anyone could 
possibly have imagined. a 
Jennifer Rohn is a cell biologist at University 
College London, UK, and editor of LabLit.com. 

Her first novel is Experimental Heart. 

e-mail: jenny@lablit.com 


History of the hard stuff 


Uncorking the Past: The Quest for Wine, 
Beer, and Other Alcoholic Beverages 

by Patrick E. McGovern 

University of California Press: 2009. 
348 pp. $29.95, £20.95 


Barley, wheat and grapes in the Middle East; 
rice, millet and hawthorn fruit in China; figs 
and dates in the Levant; sorghum and palm sap 
in Africa; maize, cacao, cactus fruit, manioc and 
pepper-tree fruit in the Americas; and every- 
where, honey. All these substrates were used by 
early humans in their quest for alcohol. 

In Uncorking the Past, biomolecular 
archaeologist and University of Pennsylvania 
museum director Patrick McGovern argues 
that the desire for alcohol is innate in humans 
and other primates. Moreover, he believes that 
“the uniquely human traits” of self-conscious- 
ness, innovation, the arts and religion have 
been “encouraged by the consumption of an 
alcoholic beverage”. This is a difficult proposi- 
tion to prove. In trying to do so, McGovern 
takes his reader ona world tour, examining the 
archaeological record for alcohol use across 
continents and cultures, searching for common 
themes that are indicative of universal use. 

The earliest pottery artefacts with identifi- 
able residues ofa fermented beverage — arising 
from a mixed fermentation of rice, honey and 
hawthorn fruit — were found in China and 
date to 7000 Bc. Three thousand miles away, 


Changed The World 


by Stephanie J. Snow (Oxford Univ. Press, £9.99) 
Stephanie Snow explores how early advances in 
anaesthetics changed society. “[This] is not a real medical 


in the Zagros mountains in western Iran, 
pottery dating to 3500 Bc has been found 
with residue of tartaric acid, indicating 
wine storage, as well as containers with a 
calcium oxalate ‘beerstone residue from 
barley beer. In Asia, Europe and the 
Americas, archaeologists have 
unearthed buildings that were 
constructed for the production and 
storage of various alcoholic 
beverages. And fermentation 
vessels and elaborate drink- 
ing sets are found in tombs of 
the rich and powerful across 
the world. Clearly, alcohol has 
been a part of human civilization 
for millennia. But has it played 
a part in the development of 
human culture? 

McGovern narrates his 
thoughts in the first person, as 
if relating them to friends over 
a drink. He intertwines his own 
research findings — detailed in 
his earlier book, Ancient Wine — 
with those of others, and tells stories 
of quests to recreate ancient bever- 
ages. He describes tasting ‘Chateau 
Jiahu’, a modern recreation of the 
earliest fermentation discovered in 
China; and the ‘Phrygian Grog’ he named 
‘Midas Touch; a fermented beverage based 
on wine grapes, honey and malted barley. 


Blessed Days of Anaesthesia: How Anaesthetics 


Stefan 1 Neston 


THE NEW 
PLAGUES 


Pandemics and Poverty 


history, nor is it seriously concerned with medicine or 
society beyond England and Scotland. But it seeks to 

link developments in anaesthesia with changing social, 
philosophical, scientific and religious attitudes in those 
countries,” wrote John Carmody (Nature 456, 38; 2008). 
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The residues of this were found in bronze 
containers in the burial chamber of a Phry- 
gian king, perhaps Midas, near present-day 


The New Plagues: Pandemics and Poverty in 
a Globalized World 

by Stefan Kaufmann (Haus Publishing, £9.99) 
An accessible and up-to-date look at diseases that 
are on the rise thanks to increased globalization. 
Describing the various strategies that agents such 
as microorganisms or prions might adopt, Stefan 
Kaufmann delves into the conflict between rich 
and poor in combating outbreaks, and looks at 
methods for containment. 
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Ankara. For a general reader, the blend of fact 
and personal narrative is enticing, reminis- 
cent of the mixed fermentations practised by 
our Neolithic ancestors; but some academics, 
thirsty for footnotes, may wish he had chosen 
a more traditional form. 

McGovern begins by speculating about 
the role of alcohol in the Palaeolithic period, 
suggesting that its shamanic use, alongside 
other drugs, helped to develop religion and 
art — a proposition that is impossible to test 
conclusively. Neolithic and early Bronze Age 
cultures produced pottery and metal vessels, 
from which residues can be analysed. In most 
cases, the first fermentations were mixtures of 
grains, honey and wild fruit. Grains presented 
a problem to early brewers because the starch 
in the grains had to be converted to sugar 
before fermentation could begin. This was 
solved in various ways in different cultures: 
the use of enzymes in human saliva to break 
down starches is still applied in Africa and the 
Andes; malting and kilning of the grains is 
another technique, raising the possibility that 
beer came before bread; and the use of mould is 
often found in Asian rice-based brews. 

Mixed fermentations, starting with higher 
sugar concentrations and natural yeast derived 
from honey and fruits, resulted in beverages of 
higher alcohol content than those based solely 
on grains. As cultures gained experience, most 
moved to single fermentations — beer, fruit 
wines or mead — with one type of beverage 
gaining dominance. The social importance 
of these beverages is reflected in the elaborate 
nature of fermentation vessels and drinking 
sets found in tombs in Asia, Europe and the 
Americas. Alcohol’s widespread use is attested 
in paintings on vessels depicting communal 
sipping of one drink through shared straws, a 
scene repeated across many cultures. 

The most powerful argument for alcohol 
as a force for innovation and social develop- 
ment is the claim that the initial domestication 
of many grains was “motivated by a desire to 
increase alcoholic-beverage production’, 
rather than to provide more food. However, 
only one example is explored in the book, 
namely the suggestion that humans were 
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Decoding the Heavens: Solving the Mystery of the 


by Jo Marchant (Windmill Books, £8.99) 

The 2,000-year-old Antikythera Mechanism was found 
in 1901, but its significance was only recently revealed. 
“[This] gripping and varied account will propel the 
mechanism to greater fame, although it may never 
achieve the celebrity of the Rosetta Stone that it probably 
deserves,” argued Andrew Robinson in his review of the 
hardback edition (Nature 455, 867-868; 2008). 


originally attracted to teosinte grass and 
maize (corn) because of the fermentable 
sweet syrup in its stalks, and that our 
selection of the seed kernels of these plants 
for food followed only afterwards. 

The broader case centres on alcohol’s 
perceived ability to spark creativity in some 


individuals, encouraging them to progress 
beyond tradition. In the words of William 
James: “Sobriety diminishes, discriminates, 
and says no; drunkenness expands, unites 
and says yes.” Although that general argument 
resonates with this reviewer, McGovern doesnt 
give any specific examples of social advance- 
ment through alcohol consumption. 
Uncorking the Past doesn't prove McGovern’s 
thesis that alcohol has been a significant force in 
human development, but it does demonstrate 
that fermented beverages have been incorpo- 
rated into the fabric of society for millennia. For 
some, taking a ‘cup of kindness’ may bea ticket 
to altered consciousness; for most, it invites 
sociability through temporary effects to our 
limbic system. Both outcomes are sorely 
needed in today’s society. a 
Jim Lapsley is adjunct associate professor 
inthe Department of Viticulture and 
Enology at the University of California, Davis, 
1 Shields Avenue, Davis, California 95616-5270, 
USA. 
e-mail: jtlapsley@ucdavis.edu 


Living by the calendar 


The Seasons of Life: The Biological Rhythms 
that Living Things Need to Thrive and 
Survive 

by Russell Foster and Leon Kreitzman 
Profile Books/Yale University Press: 2009. 
320 pp. £20/$28 


Much of biology is governed by the seasons. 
Reindeer seasonally adjust the colour of 
their eyes for better vision; newborn warb- 
lers are programmed to fly from Europe to 
an unknown destiny in Africa; hibernators 
turn down their internal thermostat for six 
months of the year. Most biologists would 
jump to unravel such seasonal feats if the 
time constraints were not so forbidding. 
Russell Foster and Leon Kreitzman lament 
in their latest book the slow pace of research 
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on annual rhythms in biology. Yet their 
fascinating story impresses with its wealth of 
facts and splendid overview. 

The Seasons of Life follows on from the 
authors’ previous collaboration, The Rhythms 
of Life (Profile Books, 2004). Russell Foster, 
a professor in circadian neuroscience at the 
University of Oxford, UK, is an eminent 
scientist in the field of chronobiology, and a 
passionate one. He helped to discover specific 
ganglion cells in the mammalian retina that 
perceive light intensity and are instrumental 
in synchronizing biological clocks. Kreitz- 
man is a science journalist with a lucid pen. 
Together, they paint a broad perspective on 
the functions and mechanisms of biological 
calendars. 

The authors dedicate five chapters to 
the adaptation of animals and plants to the 


The Philosophical Baby: What Children's 
Minds Tell Us About Truth, Love, and the 
Meaning of Life 

by Alison Gopnik (Bodley Head, £14.99) 
Alison Gopnik's well-written book is an unusual 
look at the conscious mind. Drawing on research 
and her own pioneering studies, she reveals that 
processes ina baby's mind can be as complicated 
as those in the minds of adults, and asks what 
brain development can teach us about humanity. 
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seasons, and six to human season- 
ality. This preference for humans 
is unexpected, but appropriate 
and stimulating. Research inter- 
est in human seasonality has been 
considerable but necessarily of a 
descriptive nature. The annual 
rhythm of human reproduction, 
for example, is well known and has 
been recorded extensively in birth 
records. Evolutionary zoologists 
can only dream of having similar 
vast data sets. 

Until recently, the consequences 
of birth date for human character- 
istics was a theme for astrologers 
rather than scientists. Database 
analyses now show that the inci- 
dence of a host of diseases later in 
life, such as schizophrenia, multi- 
ple sclerosis and suicidal behaviour, 
varies with the season of birth. This 
seasonal variation, the authors argue, 
holds the key to understanding the impact 
of environmental effects during 
gestation and early postnatal 


life on adult health and lifespan. 
Foster and Kreitzman mix sugges- 
tive new facts with recently recovered 
old references: for instance, the 
Babylonian king Hammurabi rec- 
ommended using sunlight in the 
treatment of illnesses 6,000 years 
ago, pre-empting recent reports of 
the alleviation by light of depres- 
sive symptoms in seasonal affec- 
tive disorder. The authors lead 
the reader into the literature on 
the systematic seasonal variations 
in suicide, on general mortality and 
on violence. 
Seasonal phenomena in plants and 
animals are more readily approached 
by experimentation than are those in 
at humans. The book tells the story of 
the discovery of photoperiodism — the 
study of the physiological changes in 
reaction to the length of daylight — first 
by Wightman Garner and Henry Allard 
in plants in 1920, and then by William 
a Rowan in songbirds in 1925. 
—~ The authors also detail the 
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finding of innate circannual clocks: 
endogenous seasonal rhythms that per- 
sist even in constant temperature and 
day length with a usual cycle length of 
around 300 days rather than 365 days. 
Circannual clocks were first found in 
hibernating ground squirrels by Eric Pen- 
gelley in 1966, and in seasonally migrat- 
ing songbirds in 1967 by Ebo Gwinner, 
the influential ornithologist to whom the 
book is dedicated. In the 40 years since 
then, significant progress has been made 
by only a few labs. The physiology of the 
circannual pace- 
maker in the Soay 
sheep, for exam- 
ple, is becoming 
better understood 
through studies by Ger- 
ald Lincoln and David 
Hazlerigg at the Centre 
for Reproductive Biol- 
ogy in Edinburgh, UK. 
Foster and Kreitzman 
have produced a tantaliz- 
ing account of the facts 
behind seasonality. Its occa- 
sional nickname, ‘nature’s 
contraceptive; reflects the key 
function of seasonal organi- 
zation: thousands of species 
across the globe, including 
those in the tropics, use 
seasonality to turn off 
reproduction at times 
of year when low 
food supply is 
expected and 
individual fitness 
is better served by 
waiting for the next season. 
The Seasons of Life is a joy to read, anda 
compelling text on the importance of sea- 
sonality in the evolution of life on Earth. m 
Serge Daan holds the honorary Niko 
Tinbergen chair in behavioural biology at the 
University of Groningen, PO Box 72, 9700 AB 
Groningen, the Netherlands. 
e-mail: s.daan@rug.nl 
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Scientists Can't Predict the Future 
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Unmeasurable verse 


Physicist and author Alan Lightman’s latest 
work is a book-length poem. In Song of Two 
Worlds, he writes from the perspective of 
aman reassessing his life after a tragedy. 
Lightman splits his epic into two sections; in 
the first, he marvels at the measurable world, 
the glory of geometry and fact. In the second, 
he explores the unmeasurable, the pleasure 
and pain of love, the beauty of a sunset and the 
night sky. An excerpt from the latter section is 
reproduced here. 


Excerpt from Song of Two Worlds 


lama fragment 

That hurtles through space 
While the breeze of the universe 
Ruffles my hair. 


Evening. | gaze 

Through my telescope, 
Searching the colors of stars. 
Some are the hues of goats’ wool, 
Some ochre olive, 

Or pink bougainvillea. 


Oliver Morton 


In chasms of space 

| see stars born from gases, 

Great thrumming furnaces oozing their heat, 
Convective motions, electron opacities — 
Elsewhere stars dying, 

Cold cinders 

Or giant explosions, eruptions of light, 

Cities consumed in a nuclear blast, 

Billions of years dimmed in a second. 


| have learned 

That the heavens are violent and fragile 
And doomed to destruction, 

Just as this thimble the earth. 

Allin the cosmos is failing, 

And nothing remains, 

And we measure the hour of the stars, 
As | measure one morning's light. 


Here, in the glass of this eyepiece. 


Song of Two Worlds 
by Alan Lightman 
A.K. Peters: 2009. 112 pp. $24.95 


Eating The Sun 
by Oliver Morton (Fourth Estate, £9.99) 


by Orrin H. Pilkey and Linda Pilkey-Jarvis 
(Columbia Univ. Press, £15.50) 

Reviewer Roger Pielke Jr wrote: “The authors 
have identified a critical challenge confronting 
the modern scientific enterprise: our ability to 
produce model-based predictions seems to have 
outpaced our ability to use such tools wisely in 
decision-making.” (Nature 447, 35-37; 2007.) 


There are few books on photosynthesis for the non- 
specialist. Eating the Sun fills that gap, covering the history 
of its discovery and its processes. “Morton's account of 
the ubiquitous importance of photosynthesis is an original 
viewpoint for looking at the world. It is written with verve 
and an eye for detail. His breadth of scholarship could 
leave other science writers green — with envy,” wrote 
reviewer Richard Fortey (Nature 449, 284-285; 2007). 
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QUANTUM INFORMATION 


Caught at the finishing line 


Bob B. Buckley and David D. Awschalom 


Quantum systems habitually leak information, limiting their usefulness for practical applications. 
By optimally reversing the leak, this information loss has been reduced to a trickle in the solid state. 


The physical interactions described by quantum 
mechanics are fundamental to describing the 
world. Harnessing these quantum interactions 
has the potential to add a new and powerful 
set of tools for quantum information tech- 
nology’. Specifically, processing information 
using ‘quantum bits’ — qubits — is like add- 
ing the power of brightness and colour to each 
black and white (1 and 0) bit used for digital 
computation today. Unfortunately, just as a 
brightly coloured picture fades in the sun, so 
these quantum bits of information fade quickly 
through a process called decoherence’. In this 
issue, Du and colleagues’ (page 1265) describe 
their work to minimize this detrimental effect 
in a solid-state system of electron spins (qubits) 
using a recently developed method that relies 
on repeatedly flipping the spins at well-defined 
intervals of time. The decohering mechanisms 
in the system reverse in sign when the spins 
are flipped, allowing most of the decoherence 
to simply subtract itself away. 

The process that allows qubits to commu- 
nicate in a useful manner also allows them to 
easily lose the information they encode. This 
information loss (decoherence) occurs when 


a 
Starting/ a 


the system interacts with itself or its surround- 
ings through uncharacterized, random or 
fluctuating processes. For an ensemble of quan- 
tum objects, decoherence can manifest itself in 
two ways. First, the system's surroundings can 
vary in space, making two seemingly identi- 
cal qubits perform differently through their 
interactions with dissimilar surroundings. And 
second, the surroundings can fluctuate in time, 
making the same qubit perform in different 
ways at different times. It has been shown that 
decoherence must be below a certain threshold 
for quantum computation to be feasible*. Thus 
far, it has proved difficult to physically realize 
a system below this threshold limit. Decoher- 
ence is therefore a central roadblock to prac- 
tical quantum computation. Finding ways to 
understand and control this effect is vital to the 
future of quantum information processing. 
Consider each qubit in the ensemble as a 
runner in a race (Fig. 1). For the purposes of 
quantum information, it is ideal for each run- 
ner to perform identically, with all runners 
crossing the finishing line simultaneously. In 
reality, some runners are faster than others. 
If one wanted them all to finish roughly at 


One flip 


finishing line 


«<< 


the same time, the rules of the race could be 
changed so that when the race was half over, 
all the runners would have to turn around 
wherever they were and begin running back 
to the starting/finishing line. The more athletic 
members of the pack would have progressed 
farther and would therefore have to run just 
as fast as before in order to tie with their less 
athletic companions. This kind of reversal race 
can be performed on quantum systems and is 
called the Hahn echo’. It corrects for both spa- 
tial variations and variations in time that are 
slower than the full duration ofa single race. 
Of course, in reality runners do not keep a 
constant pace throughout a race. Some speed 
up, some slow down and some fluctuate in 
speed as the race progresses. By reversing 
the runners’ direction multiple times during 
the race, it is possible to minimize the effects 
of their inconsistent pace, provided their 
speed does not fluctuate much faster than the 
frequency of direction flips. Analogously, 
with an appropriately timed set of spin- 
reversing pulses, Du and colleagues’ were able 
to reduce decoherence and extend the useful 
life of their qubits of choice, an ensemble of 


Multiple flips 


"A——Do 6 hg 
, 


Figure 1| A quantum race. Quantum bits in a quantum system 
decohere, and so lose the information they encode, much as runners 

in a race spread out as the race progresses. a, Both fast and slow runners 
can finish the race together by flipping everyone's running direction 
halfway through the race. The faster runners are consequently farther 
from the starting/finishing line halfway through and must run just as 
fast as before to tie with their slower colleagues. This approach works 


provided the runners do not speed up or slow down too much during 
the race. b, By flipping the runners’ direction multiple times 
throughout the race, inconsistent runners can be made to almost tie, 
provided their speed changes are less frequent than the direction 

flips of the race. Du and colleagues’ use an analogous optimal flipping 
process to repair the inconsistencies in their quantum system that lead 
to information loss. 
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electron spins in a solid-state system (a crystal 
of malonic acid). 

The idea of using pulse sequences to reduce 
decoherence is not new. Besides controlling 
electron spins’ in systems such as Du and col- 
leagues, pulsing techniques havea rich history 
in nuclear (spin) magnetic resonance. The 
optimal pulse sequence studied by Du et al. was 
discovered only recently by Gotz Uhrig’. It was 
then generalized*” for systems similar to that 
used by Du and colleagues and subsequently 
performed on trapped ions'’. The decoherence 
corrections of the Uhrig pulse sequence scale 
linearly with the number of pulses applied. It 
therefore takes N pulses of flipping the spin to 
realize a decoherence suppression up to order 
N. Many of the predecessors of Uhrig’s pulse 
sequence require an exponentially increasing 
number of pulses to produce the same sup- 
pression effect, requiring prohibitively large 
numbers of spin flips to suppress high orders of 
decoherence. The Uhrig pulse sequence has the 
ability to suppress high orders of decoherence 
with a feasible number of spin-flip pulses. 

Du and colleagues’ explored a number of 
Uhrig pulse sequences in their system. Using 
a seven-pulse Uhrig control scheme, they were 
able to improve the system’s coherence time by 
a factor of 750 compared with no control, and 
by a factor of 5 compared with the one-pulse 
Hahn-echo scheme. For up to seven pulses, 
they compared the Uhrig pulse scheme with 
other previously known pulse schemes"! and 
showed an improvement in coherence times. 
By varying parameters within the sample, 
they were also able to isolate and study various 
sources of decoherence present in their system. 
Characterizing decoherence in similar ways in 
future systems could help to identify, quantify 
and minimize its specific forms. 

Quantum control schemes such as that 
used by Du and colleagues should prove to be 
a valuable asset in understanding and attack- 
ing the decoherence of quantum informa- 
tion. These achievements are vital to pushing 
the performance of real, physical systems 
closer to that required for practical quantum 
computing. a 
Bob B. Buckley and David D. Awschalom are 
in the Center for Spintronics and Quantum 
Computation, University of California, 

Santa Barbara, California 93106, USA. 
e-mails: awsch@physics.ucsb.edu; 
buckley@physics.ucsb.edu 
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MATERIALS SCIENCE 


Emerging routes to multiferroics 


Ramamoorthy Ramesh 


Materials that combine ferroic properties — such as ferromagnetism and 
ferroelectricity — are highly desirable, but rare. A new class of multiferroic 
solids heralds a fresh approach for making such materials. 


The porous crystalline materials known as 
metal-organic frameworks (MOFs) are cur- 
rently a hot topic of research, having found 
applications in catalysis', hydrogen storage’, 
optical elements* and more. Writing in the 
Journal of the American Chemical Society, Jain 
et al.’ describe a family of MOFs that have yet 
another potentially useful characteristic. They 
are multiferroic, combining the ferromag- 
netism familiar from iron bar magnets with 
antiferroelectricity — a property in which the 
molecules of a material are ordered so that 
adjacent molecular dipoles point in oppo- 
site directions. Multiferroics are attractive 
candidates for use in electrically controllable 
microwave elements, magnetic-field sensors 
and possibly even in spintronics. 

Ferroic properties come in several forms, 
referred to as order parameters, all of which 
manifest themselves around some critical 
temperature. The primary order parameters 
are ferromagnetism, ferroelectricity (sponta- 
neous electric polarization that can be reversed 
by an electric field) and ferroelasticity (spon- 
taneous strain). But many other flavours also 
exist, including antiferroelectricity. Any mater- 
ial that combines more than one of these prop- 
erties is described as multiferroic. If the order 
parameters of a multiferroic are coupled to 
one other, then each can be manipulated by 
the application of a conjugate field from the 
other. For example, in magnetoelectric materi- 
als, the magnetic moment of the material can 
be manipulated with an electric field, or the 
electric moment with a magnetic field. Such 
materials are of great fundamental scientific 
interest, and are also highly desirable for 
several applications. 

Recent years have therefore seen a con- 
siderable worldwide effort to discover broad 
classes of multiferroics, using a combination 
of materials chemistry, theoretical approaches 
and synthetic techniques. Unfortunately, it is 
becoming increasingly clear that the electronic 
structures of molecules that are required for 
ferromagnetism and ferroelectricity tend to 
be mutually exclusive. Ferromagnetism typi- 
cally requires unpaired electrons that interact 
through a quantum-mechanical process known 
as exchange coupling. But typical ferroelectric 
materials (such as barium titanate, BaTiO,, 
and other structurally related ‘perovskite’ com- 
pounds that contain transition metals) require 
the transition-metal ion to have an empty outer 
shell of electrons. This fundamental contrast 
is the main reason why few materials are both 
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ferroelectric and ferromagnetic. In the materials 
that do have both of these order parameters, 
one is usually much weaker and appears at 
much lower temperatures than the other. 

Most researchers have adopted four main 
approaches to try to make better magnetoelec- 
tric materials’ ’ (Table 1). All of these attempt 
to introduce ferroelectricity into magnetic 
materials, using various mechanisms to side- 
step the fundamental mismatch described 
above. Magnetoelectrics have also been created 
through a composite approach — by mixing a 
ferroelectric and a ferromagnet in such a way 
that strain is the coupling medium. Perhaps 
the most promising magnetoelectric material 
so far is bismuth ferrite (BiFeO,, a perovskite), 
in which the two coupled order parameters 
are ferroelectricity and antiferromagnetism. 
Bismuth ferrite sets the benchmark in the 
global search for new magnetoelectric materi- 
als, guiding the design of possible multiferroic 
architectures, and informing the use of theoret- 
ical approaches that seek truly ferromagnetic 
ferroelectrics. 

Jain and colleagues’ MOF compounds’, 
however, represent a completely new class of 
multiferroics — previously reported magneto- 
electric perovskites were purely inorganic 
compounds, but MOFs are hybrid structures 
comprising metal ions in complex with organic 
molecules. The presence of organic molecules 
in the structures allows hydrogen bonds to 
form between the MOF’s components. It is 
these bonds that are responsible for ordering 
Jain and colleagues’ MOFs in such a way as to 
engender multiferroic properties — a first in 
the field. The authors previously identified® 
an antiferroelectric MOF structure that con- 
tained zinc ions (Zn). By replacing the zinc 
with magnetic transition-metal ions, such as 
iron(11) ions (Fe**), they were able to make 
multiferroic materials. 

The authors observed that, on cooling, their 
multiferroic MOFs undergo a transition from 
a paraelectric phase (in which the materials 
become temporarily electrically polarized in 
an external electric field) to an antiferroelectric 
phase, at critical temperatures ranging from 160 
to 180 kelvin. This corresponds to a change in 
the molecular structure of the MOF from a dis- 
ordered to a more ordered state. It is certainly 
nice to see that hydrogen-bonding effects can 
lead to relatively high transition temperatures. 
However, Jain et al. found that this transition is 
unaffected by a magnetic field, and so it is quite 
likely that there is no magnetoelectric coupling 
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TABLE 1| MECHANISMS FOR MULTIFERROICS 


Mechanism Description Examples 

Lone-pair effects In perovskites of general formula ABX3, lone pairs of BiFeO3, 
electrons on the A cation distort the geometry of the BX; BiMnO3 
anion, resulting in ferroelectricity 

Geometric frustration Long-range dipole-dipole interactions and rotations YMnO3 
of oxygen atoms generate a stable ferroelectric 
state 

Charge ordering Certain ‘non-centrosymmetric’ arrangements of ions LuFe,0, 
induce ferroelectricity in magnetic materials 

Magnetic ordering Ferroelectricity is induced by magnetic long-range order TbMnO3, DyMnO3, 
in which the arrangement of magnetic dipoles lacks TbMn,0, 
reflection symmetry 


associated with it. But at temperatures below 10 
kelvin, the compounds undergo a transition to 
another phase that is both antiferroelectric and 
weakly ferromagnetic. The fact that the transi- 
tion to the magnetic state occurs at much lower 
temperatures is typical of ferromagnetic MOFs; 
this is a consequence of the indirect nature of 
the quantum exchange interactions between 
electrons that occur in these compounds. 
Jain and colleagues’ compounds’ are a 
great start. But for practical applications, the 
transition temperatures of these multiferroic 
MOFs will need to be increased to around 
room temperature, and the strength of the 
coupling between the two order parameters 
must be increased. The beauty of MOFs is that 
their structures can easily be modified, which 
should, in principle, allow the properties of 
the compounds to be readily optimized. In the 
meantime, Jain and colleagues’ findings dem- 
onstrate the important principle that electrical 
order in multiferroic materials can arise from 


hydrogen bonding. What’s more, in a field 
dominated by materials that contain the toxic 
element lead, MOFs open up fresh opportuni- 
ties for the production of lead-free multiferroic 
compounds tailored for specific technological 
applications. a. 
Ramamoorthy Ramesh is in the Department 

of Materials Science and Engineering, and the 
Department of Physics, University of California, 
Berkeley, Berkeley, California 94720, USA. 
e-mail: rramesh@berkeley.edu 


1. Farrusseng, D., Aguado, S. & Pinel, C. Angew. Chem. Int. Edn 

48, 7502-7513 (2009). 

2. Rowsell, J.L.C. & Yaghi, O. M. Angew. Chem. Int. Edn 44, 

4670-4679 (2005). 

3. Allendorf, M. D., Bauer, C. A., Bhakta, R. K. & Houk, R. J. T. 

Chem. Soc. Rev. 38, 1330-1352 (2009). 

ain, P. etal. J. Am. Chem. Soc. 131, 13625-13627 (2009). 

Ramesh, R. & Spaldin, N. A. Nature Mater. 6, 21-29 (2007). 

Cheong, S.-W. & Mostovoy, M. Nature Mater. 6, 13-20 (2007). 

Eerenstein, W., Mathur, N. D. & Scott, J. F. Nature 442, 

759-765 (2006). 

8. Jain, P., Dalal, N. S., Toby, B. H., Kroto, H. W. & Cheetham, 
A.K.J. Am. Chem. Soc. 130, 10450-10451 (2008). 


NOES 


EVOLUTIONARY BIOLOGY 


Arrhythmia of tempo and mode 


Paul B. Rainey 


An exercise in experimental evolution using bacteria has been running for 
more than 20 years and 40,000 generations. The results to date provide a 
glimpse of anew world, and are cause for both delight and unease. 


In his seminal book Tempo and Mode in 
Evolution’, palaeontologist George Gaylord 
Simpson argued for the value of distinguish- 
ing between the tempo of evolutionary change 
(the rate) and its mode (the process); more- 
over, he argued that tempo could be used to 
infer mode. Simpson's primary interest was 
the large-scale variations in rate and pattern 
evident in the fossil record. But his thesis has 
been far-reaching, as influential to students of 
organismal evolution’ as to those interested in 
the evolution of molecules’. Indeed, despite 
numerous uncertainties, molecular evolu- 
tionists use knowledge of the tempo of DNA- 
sequence evolution at specific loci to infer the 
mode of organismal evolution. On page 1243 


of this issue, a group led by Richard Lenski — 
Barrick et al.* — reports the results of work 
that provides direct insight into the rate of 
genomic change and organismal adaptation 
over 40,000 generations of evolution. 

Imagine travelling back through evolution- 
ary time, capturing at specific time intervals a 
complete record of the genome from a single 
evolving lineage, and recording all mutational 
events and the magnitude of their effects. Imag- 
ine that, on completion, one has a picture of 
not just changes in physical traits (phenotype), 
but also the underlying dynamic of genomic 
evolution. Barrick et al." do precisely this. In 
1988, Lenski tooka single archived clone of the 
bacterium Escherichia coli B (clone REL606, 
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otherwise known as the ancestor), and placed 
it in a simple glucose-limiting environment 
where it has existed ever since, being kept in a 
continuous state of growth by daily transfer to 
fresh medium’. At regular intervals, samples of 
derived populations have been collected, the 
reproductive success (fitness) of these types 
has been determined relative to the ancestor, 
and samples have been cryogenically stored for 
future reference. 

Using ‘next-generation’ DNA-sequencing 
technologies, Barrick et al. determined the 
entire nucleotide sequence of the bacterium’s 
single chromosome from individual clones 
taken at six different time points — from one 
of 12 replicate populations — over the 20 years 
of evolution. By comparison with the ancestor, 
the mutations underlying 40,000 generations 
of evolution are revealed. 

Because clones were sequenced at regu- 
lar intervals, the rate of genomic evolution 
becomes apparent: over the first 20,000 gen- 
erations, mutations accumulated at the rate of 
about 2 per 1,000 generations. This clock-like 
tempo is strongly suggestive of mode. Indeed, 
it is indicative of a neutral mode of evolution 
— evolution driven not by natural selection, 
but by random sampling of selectively neutral 
mutants®. According to the neutral theory*, 
mutations are expected to become stand- 
ard (fixed) in populations at a constant pace. 
That pace is determined by the rate at which 
new mutations arise spontaneously in indi- 
vidual cells — a rate ultimately determined 
by the error rate of enzymes involved in DNA 
metabolism. 

But this makes little sense, because the 
authors’ measures* of organismal adapta- 
tion show overwhelming evidence of natural 
selection. In fact, during the course of the first 
20,000 generations, the reproductive success 
of REL606 descendants improved dramati- 
cally. Moreover, the increase was strongly 
nonlinear. During the first 2,000 generations, 
fitness increased 1.5-fold relative to ancestral 
genotype — thereafter, the rate of improve- 
ment decreased (Fig. 1). 

These discordant facts leave us in an uncom- 
fortable position: the clock-like tempo of 
genomic evolution suggests a neutral mode 
of organismal evolution, but the tempo of 
organismal evolution bears the hallmarks 
of evolution by natural selection. Fortunately, 
evolution experiments with microbes offer 
opportunities to delve into mechanistic detail, 
and Barrick et al. do just that. They present 
compelling evidence that the majority of muta- 
tions arising over the first 20,000 generations of 
evolution are beneficial and that their fixation 
is due to natural selection. 

In rejecting a neutral explanation for the 
mutations, Barrick et al. give life to numerous 
questions. The relationship between genomic 
and organismal (adaptive) evolution is without 
doubt counter-intuitive (Fig. 1). From an empir- 
ical perspective, there isa need to know whether 
evolution in this single replicate population 
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Figure 1| Discordance in the coupling between 
genomic and adaptive evolution. In this 
summary of the tempo of genomic change 
(number of mutations) and organismal 
adaptation (fitness) over 20,000 generations of 
Escherichia coli evolution, the clock-like rate 
of genomic evolution revealed by Barrick and 
colleagues’ is difficult to understand given 
that the rate of fitness improvement decreases 
with time. In the absence of direct insight into 
genomic evolution, one would predict the rate 
of genomic evolution to have declined due to 
a reduction in the rate of appearance of new 
beneficial mutations, or a reduction in the 
average benefit of each mutation (or both). 


is typical of the 11 other replicate lines. If it is 
typical, then one is left wondering whether an 
explanation will lie in the ecology of compet- 
ing beneficial mutations, in subtle increases in 
mutation rate, or in the underlying details of 
the genotype-to-phenotype map. Nonethe- 
less, the facts stand: this first glimpse into a 
new world will never, despite the excitement, 
be fully satisfactory. 

The astute reader will recognize that there 
has thus far been no comment on the last 20,000 
generations of evolution. This is intentional. 
After generation 20,000, a mutation arose in 
mutT — a gene encoding a protein involved in 
DNA repair — which caused the mutation rate 
to increase; the increase in mutation rate radi- 
cally altered the tempo and mode of genomic 
evolution, although it seems to have had rela- 
tively little impact on organismal evolution. 
The mutations arising during the first period of 
evolution numbered 45 in total: over the ensu- 
ing 20,000 generations, around 600 additional 
mutations became fixed. The footprint left by 
these mutations has the signature of neutral 
evolution, evident primarily by the fact that 
many mutations had no impact on the amino- 
acid sequence of the encoded proteins. 

The complexity of the relationship between 
tempo and mode of evolution at the genomic 
and organismal levels is the cause of some 
unease, and suggests that caution needs to be 
exercised in inferring mode of organismal evo- 
lution from rates of evolution evident in DNA. 
Simpson, however, would undoubtedly have 
relished the delight that direct knowledge of 
evolution on this scale brings. It is quite won- 
derful to see first-hand the chaos generated by 
the mutT mutation. Such insight shows just how 


powerful science — and particularly the science 
of experimental evolution — can be, when, as 
Barrick et al.* reveal, the question asked finds 
resonance with the technical assay. a 
Paul B. Rainey is at the New Zealand Institute for 
Advanced Study, and the Allan Wilson Centre 

for Molecular Ecology and Evolution, Massey 
University, Auckland, New Zealand. 

e-mail: p.b.rainey@massey.ac.nz 


1. Simpson, G. G. Tempo and Mode in Evolution (Columbia 
Univ. Press, 1944). 

2. Gould, S.J. The Structure of Evolutionary Theory (Belknap, 
2002). 

3. Nei, M. Molecular Evolutionary Genetics (Columbia Univ. 
Press, 1987). 

4. Barrick, J. E. etal. Nature 461, 1243-1247 (2009). 

5. Lenski, R. E., Rose, M. R., Simpson, S. C. & Tadler, S. C. 
Am. Nat. 138, 1315-1341 (1991). 

6. Kimura, M. The Neutral Theory of Molecular Evolution 
(Cambridge Univ. Press, 1983). 


ASTROPHYSICS 


Most distant cosmic blast seen 


Bing Zhang 


The most distant y-ray burst yet sighted is the earliest astronomical object 
ever observed in cosmic history. This ancient beacon offers a glimpse of 


the little-known cosmic dark ages. 


In this issue, two papers’” report the discovery 
of a y-ray burst (GRB) at a redshift of about 
8.2. This is the highest redshift — or equiva- 
lently the most distant — astronomical object 
ever detected in the Universe. For comparison, 
the highest redshift recorded so far for galaxies 
is about 6.96 (ref. 3). For quasars — extremely 
bright galactic nuclei powered by supermassive 
black holes — the record holder is an object at 
6.48 (ref. 4). 

Tanvir et al.’ (page 1254) and Salvaterra et al.’ 
(page 1258) measured a concordant redshift 
for GRB 090423 on the basis of observations of 
its fading afterglow emission at different times 
after the initial burst of y-rays. In astronomy, a 
larger distance (or redshift) corresponds to an 
earlier time in cosmic history, because it takes 
longer for a farther-away photon, which trav- 
els at finite speed, to reach Earth. Moreover, 
because the Universe is expanding, the wave- 
length of electromagnetic radiation emitted by 
a distant object is stretched to be longer and 
redder (redshifted) on its course to Earth. The 
farther away the source, the more the Universe 
has expanded, and therefore the higher the 
source’s redshift. The redshift measured for 
GRB 090423 means that the burst occurred at 
atime when the Universe was about nine times 
smaller than it is today — putting the timing 
of the event at about 630 million years after the 
Big Bang. 

GRBs are the most violent explosions in the 
Universe. They are believed to be associated 
with the formation of stellar-sized black holes 
or rapidly rotating, highly magnetized neutron 
stars during cataclysmic events such as the 
collapse of a massive star or the coalescence of 
two compact stellar objects. The reason why 
GRBs can outshine galaxies and quasars, which 
are much more massive, to become the redshift 
record holders (Fig. 1) is threefold. 

First, thanks to their high speed (99.9995% 
of the speed of light), their maximum luminos- 
ity is extremely high, dwarfing that of galaxies 
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and quasars by many orders of magnitude. 

Second, whereas galaxies and quasars look 
progressively dimmer at higher redshifts (both 
because of a larger distance from Earth and 
an intrinsic smaller mass at an earlier cosmic 
time), the apparent brightness of GRBs and 
their infrared afterglows (which hold the key 
to redshift identification) do not decrease sig- 
nificantly with increasing redshift. This is due 
to a concerted combination of two effects, the 
k-correction and the time-dilation effect’. 

Finally, whereas bright galaxies and quasars 
become rarer as redshifts rise above 7, theoreti- 
cal models suggest that massive stars, thought 
to be the progenitors of high-redshift GRBs, 
can form much earlier®. As a result, GRBs can 
be more easily detected at higher redshifts, and 
may hold the key to illuminating the cosmic 
‘dark ages’ (Fig. 2, overleaf). 

Although satellites such as Swift’ have been 


Redshift 


) 


1960 1970 1980 1990 2000 2010 


Figure 1| Redshift ladder. Timeline of redshift 
record-breaking for three classes of astronomical 
object: galaxies (red), quasars (green) and y-ray 
bursts (blue). The measurement of y-ray-burst 
redshift began much later than that of galaxies 
and quasars, but since the first discovery 

it has seen a much faster pace of redshift 
record-breaking. (Courtesy of N. Tanvir.) 
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Big 
Bang 


GRB 090423: 
630 million years 


Atoms form: Re-ionization ends: Now: 
400,000 800-900 million 13.7 billion 
years years years 


Figure 2 | The cosmic dark ages and GRB 090423. After the Big Bang, the Universe cools rapidly while 
expanding. About 400,000 years after this event, free electrons and protons combine to form neutral 
atoms, leaving a bath of background radiation that currently shines in the microwave part of the 
electromagnetic spectrum. Thereafter, the Universe remains neutral, until the first stars and galaxies 
light up at a later epoch. Photons emitted by these objects knock electrons out of atoms and ‘re-ionize’ 

the Universe. Studies of the most distant galaxies and quasars suggest that the re-ionization process was 
completed around 800 million to 900 million years after the Big Bang, but no information is available 
about the cosmic ‘dark ages. Observations of y-ray bursts such as GRB 090423 (refs 1, 2), which occurred 
about 630 million years after the Big Bang, offer a glimpse of the cosmic dark ages. (Adapted from ref. 15.) 


successful in detecting high-redshift GRBs, 
characterizing these objects and measuring 
their redshifts is a different matter. To do so 
requires catching their rapidly fading after- 
glows and identifying their spectral signa- 
tures. The main signature is a fall in emissions, 
caused by intervening hydrogen-gas clouds 
along the light path, bluewards of the Lyman-a 
resonance line of hydrogen. For high-redshift 
GRBs, this feature is best observed with near- 
infrared observations, because the original 
ultraviolet light of such GRBs is redshifted into 
the observable near-infrared window. 

For GRB 090423, Tanvir et al.' observed 
the near-infrared afterglow, starting about 
17.5 hours after the burst, using the European 
Southern Observatory 8.2-metre Very Large 
Telescope in Chile. Meanwhile, Salvaterra 
et al.” observed the burst from about 14 hours 
after it occurred using the 3.6-metre Italian 
Telescopio Nazionale Galileo in Spain. Both 
teams discovered a clear break in emissions 
at wavelengths shorter than about 1.1 micro- 
metres, which corresponds to the Lyman-a line 
frequency at the derived redshift. 

The authors’ discovery’” opens up the 
exciting possibility of studying the cosmic 
‘re-ionization’ epoch, and the preceding dark 
ages, using GRBs. In its spectrum, particularly 
in the absorption ‘damping wing, a GRB carries 
information about the fraction of neutral gas 
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in the intergalactic medium (IGM)*, and so the 
IGM re-ionization state, at the cosmic time at 
which it occurred. Studying a number of GRBs 
spread over a range of redshifts would allow 
one to map out the course of the re-ionization 
process throughout cosmic time. 

Realistically, two factors seem likely to hinder 
this prospect. First, afterglow observations 
suggest that there is a large amount of neutral 
gas in the interstellar medium (ISM) surround- 
ing GRBs. In some cases, the contribution of 
the ISM to GRB light-absorption dominates 
over that of IGM absorption, making it dif- 
ficult to extract information about cosmic 
re-ionization from the data®. But cosmologi- 
cal simulations suggest that, at high redshifts, 
this ISM effect should decrease with increasing 
redshift’, a trend that seems to be supported 
by observations of the second-highest-redshift 
burst, GRB 080913 (ref. 10). More high-redshift 
GRB data are needed to verify or disprove this 
prediction. 

Second, afterglows fade rapidly with time. 
High-resolution spectroscopy is therefore 
needed at early (post-burst) epochs to catch 
them when they are still bright. In the case of 
GRB 090423, high-resolution spectra’ were 
taken from about 14 hours after the burst, 
when the afterglow had already faded consid- 
erably. The spectra presented by Tanvir et al. 
and Salvaterra et al.” can therefore only serve 
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the purpose of redshift identification. To make 
further progress in studying the high-redshift 
Universe with GRBs, prompt alerts on high- 
redshift GRB candidates would be desirable. 
It is hoped that this could be achieved in the 
future by GRB space missions such as SVOM, 
JANUS and EXIST, or by ground-based, large 
near-infrared robotic telescopes. 

That said, GRB 090423 still provides much 
information about the high-redshift Universe 
and the mechanisms that underlie GRB for- 
mation at that cosmic epoch. Salvaterra and 
colleagues’ note that the burst detection may 
suggest that star-formation rate at high red- 
shift is anomalously high, or that the GRB 
luminosity function (the relative number of 
GRBs that have a certain luminosity) evolves 
with redshift. Tanvir and colleagues’, however, 
point out that its detection is consistent with 
the high-redshift star-formation rate predicted 
by some theoretical models’’. More data are 
needed to solve the authors’ discrepant inter- 
pretations of the detection of GRB 090423. 

Moreover, both studies!” report that the 
X-ray and near-infrared afterglows of the burst 
are not very different from those of nearer 
GRBs, suggesting that progenitor stars simi- 
lar to those of their more recent counterparts 
already existed at cosmic times as early as 
630 million years after the Big Bang. Detec- 
tion of radio afterglow and afterglow model- 
ling” suggest that the circumburst medium 
of GRB 090423 is also similar to that of its 
nearby cousins. Taken together, all of these 
observations'”” indicate that the progenitor 
of GRB 090423 is not one of the first-genera- 
tion stars, which, unlike their present-day ana- 
logues, are believed to be much more massive 
and metal poor (containing only hydrogen and 
helium)’. 

Finally, the (rest-frame) duration of 
GRB 090423 is slightly longer than 1 second. 
Given its redshift, this is an unexpected prop- 
erty, but one that is shared by GRB 080913, 
and that implies that the burst may fall into 
the short-lived category. However, several 
arguments’* suggest that both GRB 090423 
and GRB 080913 are related to the collapse of 
massive stars, rather than the merging of com- 
pact objects that is believed to power nearby, 
short-lived GRBs“. 

The apparently short durations of these 
two bursts may be due to an observational- 
selection effect. A more intriguing possibility, 
which future observations could test, is that 
this is related to the properties of GRB pro- 
genitors at increasingly higher redshifts, which 
seem to produce intrinsically shorter bursts. 
In any case, thus far, the indication is that GRB 
duration alone is no longer necessarily the 
crucial criterion to discern the physical nature 
of GRBs”, a 
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CATALYSIS 


Bond control in surface reactions 


Jens K. Norskov and Frank Abild-Pedersen 


Catalysts steer reactions towards certain products — but the basis of 
their control is often unclear. Quantum chemical calculations reveal which 
parameters control bond formation in a network of catalytic reactions. 


Control of bond formation in reactions is 
the key to making chemical production 
more energy-efficient and product-specific 
— thereby minimizing the formation of 
unwanted side products. It is also crucial for 
developing sustainable industrial processes 
for manufacturing fuels. In large-scale chemi- 
cal processes, such “bond control’ is typically 
exerted by the surfaces of catalysts that consist 
of porous solids, or of nanoparticles supported 
on other materials. Writing in Angewandte 
Chemie, Loffreda et al.’ report an essential 
step towards understanding the order in which 
chemical bonds form in molecules bound to a 
catalyst’s surface. They consider a prototype 
reaction in which unsaturated aldehydes react 
with hydrogen on a platinum surface (Fig. 1), 
and show that quantum chemical calculations 
can provide a simple set of rules to determine 
which bond is hydrogenated first. 

Quantum chemical methods for describing 
surface reactions have developed extensively 
during the past decade, and have now reached 
the point at which complete catalytic reac- 
tions can be described in some detail. Indeed, 
the first examples in which such insight has 
been used to design new catalysts have been 
reported” *. Currently, however, only the sim- 
plest reactions can be modelled in full using 
quantum calculations. For complex reaction 
networks, a more promising approach is to 
develop an understanding of which factors 
within a network determine the overall rates. 
For questions relating to reaction selectivity — 
that is, which bond reacts first — the challenge 
is to identify the descriptors that determine the 
relative rates of all of the possible pathways in 
which different bonds react. These descrip- 
tors can then be calculated or measured in the 
search for new catalysts. 

The overall reactivity of a given catalyst 
is traditionally understood in terms of the 
Sabatier principle’, which states that the rate of 
a reaction is maximized when the interactions 
between the intermediates and the catalyst 


are neither too weak nor too strong. Weak 
bonding makes the catalyst too inert to allow 
the reaction to take place, whereas overly 
strong bonding may cause the catalyst to hold 
on to intermediates, hampering the formation 


+H, 
u 
Propanal 
+H, 
Lv 
Catalyst Propenol 
& +H, . 
Acrolein 
(E)-1-Propenol 
+2H, 


Ge 


Propanol 


Figure 1| Selective hydrogenation of the 
unsaturated aldehyde acrolein. Acrolein can, in 
principle, react with hydrogen in the presence of 
a metal catalyst to yield four possible products: 
propanal, a saturated aldehyde; propenol, an 
unsaturated alcohol; (E)-1-propenol, an enol; 
and the fully hydrogenated compound propanol, 
a saturated alcohol. Loffreda et al.' find that the 
rates of each reaction are governed entirely by the 
strength with which the respective intermediates 
bind to the surface of the catalyst. This parameter 
can thus be used to predict which bonds of 
unsaturated aldehydes will react in catalytic 
hydrogenations. Carbon atoms, grey; oxygen 
atoms, red; hydrogen atoms, white. 
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of products. The energies of bonds formed 
between the main reaction intermediates and 
catalytic surfaces therefore determine the overall 
catalytic rate, and can be used as descriptors 
of that rate’. Only a few bond energies are 
usually needed to determine a catalysts reacti- 
vity, because the various parameters associated 
with reactivity tend to scale with each other, 
effectively limiting the number of independent 
variables””. Specifically, the activation energies 
of reactions (the energies required to initiate 
reactions) generally scale with the energy 
difference between reaction intermediates, 
and surface-bond energies for different inter- 
mediates in a reaction pathway often scale with 
each other. 

Loffreda et al.’ report that specific surface- 
bond energies also work as descriptors for reac- 
tion selectivity. They first show that the order 
in which the different double bonds of acrolein 
(a simple unsaturated aldehyde, C;H,O) are 
hydrogenated on the surface of a metal such 
as platinum can be understood by dividing all 
of the possible reactions that could occur into 
four groups. Each group is defined by its reac- 
tion centre — one of the three carbon atoms, 
or the oxygen atom. The authors find that, 
for each group and at any given point along a 
reaction pathway, the transition-state energy 
(the energy at which the complex of reactants 
becomes more product-like than reactant-like) 
scales linearly with the surface-bond energy of 
the preceding intermediate—metal complex. In 
fact, there is an almost one-to-one correlation 
between the energies. This means that the acti- 
vation energies for all of the different reactions 
that can take place at each centre are about the 
same, something that has also been found in 
studies of smaller molecules reacting on a 
variety of transition metals*. 

Intriguingly, Loffreda et al.’ go on to show 
that the same scaling relationship also applies 
to another unsaturated aldehyde, prenal. This 
means that the transition-state energies for pre- 
nal hydrogenation could have been predicted 
by calculating the surface-bond energies for 
the relevant reaction centres of that molecule. 
For any given reaction that has several possible 
reaction pathways, the one that has the lowest 
transition-state energy typically has the fast- 
est rate — it will occur before the others. The 
authors have therefore shown that surface- 
bond energy is a descriptor of reaction selec- 
tivity. If this is a general result, then it will allow 
the reaction selectivities for hydrogenations of 
many aldehydes on many different catalysts to 
be screened rapidly using quantum chemical 
calculations. Hydrogenations are exception- 
ally useful reactions in industry, so this is an 
exciting prospect. 

Loffreda and colleagues’ work exemplifies 
the great progress that has been made in our 
understanding of solid-state catalysts in recent 
years as a direct result of quantum chemical cal- 
culations. Extending the use of computational 
techniques to complex reaction networks is the 
next great challenge — but there are already 
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promising signs that this can be done, and 
that these techniques will become an essential 
tool for catalyst design. To fulfil this promise, 
improvements to the accuracy of quantum 
chemical calculations are needed, as well as 
faster computers and algorithms to enable 
calculations for systems of greater complexity. 
But perhaps more critically, as Loffreda et al.' 
show, we need the insight provided by compu- 
tational techniques to distinguish between the 
important and the less important parameters 
of catalysis. a 
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STRUCTURAL BIOLOGY 


DNA binding shapes up 


Tom Tullius 


DNA-binding proteins have the daunting task of finding their binding sites 
among the 3 billion base pairs of the human genome. The shape of DNA, 
and not just its sequence, may offer proteins much-needed direction. 


The genetic information embodied in DNA 
must be decoded at the right time and in the 
right type of cell. To achieve this, proteins that 
control such processes have to bind to specific 
places in the genome. How a protein finds the 
correct spot to bind to among all the possi- 
ble sites (3 billion base pairs in the human 
genome, for example) has been the preoccu- 
pation of molecular and structural biologists 
for decades. 

Watson and Crick taught us that DNA adopts 
the form of a double helix, and much of DNAs 
biological function is evident from the com- 
plementary strands that make up this iconic 
helix. These days, we tend to think of DNA asa 
string of letters (A, G, C and T), which stand for 
the bases of the four nucleotides that make up 
the DNA polymer. The familiar genetic code 
consists of sets of three DNA nucleotides that 
specify one of the 20 amino acids that make up 
proteins; the deciphering of this code was one 
of the triumphs of the early days of molecular 
biology. 

One might imagine that a protein could 
recognize its binding site in the genome by 
somehow ‘reading’ these letters, and in fact 
this has been found to be the case from study- 
ing structures of protein-DNA complexes. The 
DNA double helix has two grooves, a major and 
a minor one, that wind around the central axis 
of the molecule, and reading is achieved using 
hydrogen bonds that form between protein 
side chains and the edges of the DNA nucleo- 
tides that are exposed in the major groove. 
But unlike the genetic code, a simple code for 
protein recognition of DNA has not emerged 
despite years of effort. 

Perhaps we lose something in the simpli- 
fication of DNA to a one-dimensional string 


of letters. We may forget that DNA is a mol- 
ecule with a three-dimensional shape that is 
not perfectly uniform. Rohs et al.’ (page 1248 
of this issue) now find that one structural 
feature of DNA, the shape of its minor groove, 
can be exploited by proteins for specific 
recognition. 

A particular sequence of nucleotide letters 
presents a unique array of hydrogen-bond 
donors and acceptors in the major groove, 
providing a clear mechanism for reading that 
sequence’. The minor groove, though, has been 
inscrutable, as a simple code of hydrogen-bond 
donors and acceptors that specify a nucleotide 
sequence is not present in this groove. 

The minor groove has another trick up its 
sleeve — its width varies depending on which 
nucleotides are present in a segment of DNA. 
And the width of the minor groove has a physi- 
cal consequence that goes beyond the merely 
structural, stemming from the charged groups 
(phosphates) that are arrayed along the outer 
edge of the DNA backbone, one per nucleotide 
(Fig. 1a). Where the minor groove is narrow, the 
electric-field lines due to the negatively charged 
phosphates are focused into the groove, leading 
to an enhanced negative electrostatic potential 
in that segment of the double helix. 

Rohs and colleagues’ exhaustively searched 
the databases of three-dimensional structures 
of protein-DNA complexes and found many 
examples of proteins that use amino acids 
containing positively charged side chains, 
principally arginines, to read the electrostatic 
potential of the minor groove. Where the groove 
width and electrostatic potential are optimal, 
an arginine side chain of a DNA-binding pro- 
tein is often seen to sit snugly in the minor 
groove (Fig. 1b). It is important to appreciate 
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Figure 1| Getting into the groove. Rohs et al.' 
report that the shape of the minor groove of 
DNA can direct the binding of proteins to 
specific sites. a, Negatively charged phosphate 
groups (magenta) are arrayed along the outer 
edge of the DNA major and minor grooves that 
spiral around the axis of the double helix. The 
width of the minor groove varies depending 

on the sequence of nucleotides. This variation 
leads to differences in the distance between 
phosphates across the groove, which in turn lead 
to variation in the negative electrostatic potential 
along the minor groove. b, A representation 

of a DNA-binding protein (green) that has a 
positively charged side chain on its surface, 

for example arginine (purple), is shown. The 
protein may recognize a binding site on DNA 
by its electrostatic potential. The protein is 
about to bind to the segment of the DNA minor 
groove that has the optimum groove width 

and electrostatic potential for binding (red). 
The DNA structure in a and b is derived from 

a structure in the RCSB Protein Data Bank, 
accession number 2061. The illustration of a 
DNA-binding protein in b is hypothetical. 


that this mechanism of DNA recognition is 
distinct from the direct readout of hydrogen- 
bond donors and acceptors in the major 
groove, because different strings of nucleotides 
can adopt similar minor-groove shapes. 

For this to be a means of recognizing infor- 
mation encoded in the DNA sequence, there 
must be particular nucleotide sequences that 
result in a narrow minor groove. Again, by 
searching the structural databases, Rohs et al.' 
found that short runs of adenine nucleotides 
(called A-tracts) have a strong tendency to 
induce a narrow minor groove. The special 
structural properties of A-tracts have been 
known for three decades, and have been princi- 
pally associated with DNA-sequence-directed 
curvature’. Curvature of DNA requires a series 
of short A-tracts, spaced at the DNA helix 
repeat of 10 base pairs. What is new in Rohs and 
colleagues’ analysis’ is their demonstration that 
an isolated A-tract can influence DNA shape in 
such a way that it can serve as a site for specific 
recognition by DNA-binding proteins. 
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It’s not only sequence-specific DNA-binding 
proteins that have figured out how to read the 
electrostatic potential of the minor groove. 
Rohs et al.’ show that histone octamers, the 
protein complexes around which DNA wraps 
to form nucleosomes, also exploit this mecha- 
nism. By comparing nucleotide sequences of 
DNA segments known to form nucleosomes, 
they show that short A-tracts have a tendency 
to be periodically spaced throughout such 
sequences. They also find, from X-ray struc- 
tures of nucleosomes, that arginines are often 
present in the minor groove where it narrows 
as a consequence of wrapping the DNA double 
helix around the histone octamer. 

The ability to sense the variation in elec- 
trostatic potential in DNA may reveal howa 
protein could home in on its binding site in the 
genome without touching every nucleotide, as 
electrostatics is a through-space phenomenon. 


A difficulty of applying the analysis presented in 
this study is that it depends on high-resolution 
three-dimensional structures of protein-DNA 
complexes — that’s the necessary input to the 
Poisson-Boltzmann equation that was used by 
Rohs et al.' to calculate the electrostatic poten- 
tial of the minor groove. If we could develop 
an experimental measure of the minor-groove 
potential and how it varies with sequence, it 
would be possible to read the human genome 
like a protein does, treating DNA as a molecule 
and not just a string of letters. a 
Tom Tullius is in the Department of Chemistry 
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STATISTICAL PHYSICS 


Swirled by light 


Mark |. Dykman 


A micrometre-sized particle immersed in a liquid can be trapped by light. 
An experiment shows that the trapping can be accompanied by a whirling 
whose direction can be reversed by changing the light intensity. 


The past few decades have witnessed remarkable 
progress in the control and manipulation of tiny 
objects, including micrometre-sized particles, 
bacteria and DNA molecules. These advances 
are associated with the development of opti- 
cal tweezers'” — instruments that use strongly 
focused laser light to trap objects and move 
them at will. However, the stiffness of optical 
tweezers is limited, and an optically trapped 
particle does not reside at the trap centre 
but rather wanders about it. Such wandering 
is a consequence of thermal fluctuations in the 
particle's surrounding liquid, and is a counter- 
part of the random motion experienced by 
free-floating particles — the Brownian motion 
explained by Albert Einstein in 1905. 

Writing in Physical Review E, Sun et al.* 
report that the trajectory of an optically trapped 
colloidal particle in water, although random, 
can display a complicated but identifiable 
pattern. The particle’s motion is characterized 
by a circulatory bias — the streamlines that 
are obtained by averaging over an ensemble of 
trajectories exhibit circulation. The authors 
associate this circulation with a toroidal 
particle current and call it a Brownian vortex. 
Interestingly, they find that the overall 
streamline pattern and the direction of the 
circulation depend on the intensity of the 
laser light used to trap the particle, with the 
possible coexistence of counter-rotating rolls of 
toroidal current. (A spurious inward spiralling 
in the streamline pattern was addressed by 
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the authors in an independent study’. 

Sun and colleagues’ observation’ of a macro- 
scopic flux — in the form of streamline circula- 
tion — is both interesting in itself and a clear 
indication that the particle is not at thermal 
equilibrium. The flux was found ina stationary 
optical trap, in contrast to the directed motion 
of Brownian particles seen previously* in non- 
stationary traps, in which a ratchet-like optical 
potential was periodically modulated in time. 

The absence of macroscopic fluxes is a 
fundamental property of classical systems in 
thermal equilibrium. Such fluxes would dis- 
sipate energy through some form of friction, 
and would decay if there was no energy input 
from the outside. By contrast, non-equilibrium 
systems that gain energy from external sources 
should generally display fluxes. These fluxes 
can be stationary — a property compatible 
with the system as a whole being stationary 
(with no accumulation of particles, energy 
and so on) and with the probability distribu- 
tion of its dynamical variables being independ- 
ent of time. For optically trapped, transparent 
colloidal particles in thermal equilibrium, the 
stationary probability distribution has been 
studied in detail®”. 

Non-equilibrium, flux-carrying systems 
can be divided into two classes. One com- 
prises systems characterized by a stationary 
flux that is sustained by a periodic or constant 
external driving and does not require fluc- 
tuations (thermal or non-thermal) to persist. 
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A well-known physical example of such a 
system is a laser: once a laser starts radiating 
in response to external pumping, the primary 
effect of fluctuations is to reduce the coherence 
of the laser radiation — that is, the extent to 
which the emitted laser light waves are in step 
with one another decreases. By contrast, in 
systems belonging to the second class, external 
driving on its own cannot support a station- 
ary flux; the very occurrence of the flux is due 
to fluctuations’. It is into this second category 
that the colloidal-particle system studied by 
Sun and colleagues’ falls. 

An insight into the onset of a fluctuation- 
facilitated flux can be gained by looking at the 
fluctuation dynamics ofa system. In the 1950s, 
Onsager and Machlup noticed? that, for a col- 
loidal particle in thermal equilibrium, the most 
likely trajectory from the equilibrium position 
to a given spatial point coincides with the time- 
reversed most likely path the particle would 
follow from that point. Therefore, most prob- 
ably, the particle arrives at a point with a veloc- 
ity opposite to the one with which it leaves the 
point. The implication is that there is no net flux. 
By contrast, the most probable trajectories of 
non-equilibrium systems, such as that studied 
by Sun et al.’, lack time-reversal symmetry, and 
this leads to the onset ofa flux. That there is a 
difference between the most probable trajecto- 
ries of a non-equilibrium system to and from a 
given point has been demonstrated both in ana- 
logue electrical circuits’ and experimentally’. 

Sun and colleagues’ observation’ of a 
stationary flux and its intricate structure for 
an optically trapped colloidal particle provides 
insight into the physics of systems away from 
thermal equilibrium, demonstrating features 
in their dynamical behaviour that have no 
analogue in systems in thermal equilibrium. 
The trajectory-based approach is useful for 
gaining such insight because of its sensitivity 
to the dynamics of the system. In the context 
of optical tweezers and their applications, the 
experiment raises questions about the nature 
of the forces to which a particle is subjected in 
an optical trap and the reason the particle is not 
in thermal equilibrium. a 
Mark |. Dykman is in the Department of Physics 
and Astronomy, Michigan State University, 

East Lansing, Michigan 48824, USA. 
e-mail: dykman@pa.msu.edu 
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Volatile accretion history of the terrestrial 
planets and dynamic implications 


Francis Albaréde! 


Accretion left the terrestrial planets depleted in volatile components. Here | examine evidence for the hypothesis that the 
Moon and the Earth were essentially dry immediately after the formation of the Moon—by a giant impact on the proto-Earth— 
and only much later gained volatiles through accretion of wet material delivered from beyond the asteroid belt. This view is 
supported by U-Pb and I-Xe chronologies, which show that water delivery peaked ~100 million years after the isolation of the 
Solar System. Introduction of water into the terrestrial mantle triggered plate tectonics, which may have been crucial for the 
emergence of life. This mechanism may also have worked for the young Venus, but seems to have failed for Mars. 


arth, Mars and Venus are three planets with very different 

histories. Active plate tectonics, the presence of a liquid 

ocean, and teeming life characterize the Earth. Mars has a 

tenuous atmosphere, shows no incontrovertible evidence of 
plate tectonics, lacks an abundant hydrosphere, and has so far evaded 
all attempts to reveal the presence of life. Venus is a dead inferno of 
carbon dioxide with clouds of sulphuric acid, and its solid surface 
seems to have been violently resurfaced some 600-900 million years 
(Myr) ago. At a time when so many satellites and telescopes are 
probing the Universe in search of habitable planets, it is worth ques- 
tioning what made three planets of our Solar System so different from 
each other that only one of them harbours life. 
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Figure 1 | Fractionation of Zn isotopes is incompatible with volatilization. The 
data*’ represent the 8©Zn value, the relative deviation of the °Zn/Zn ratio in 
per mil with respect to a terrestrial standard, of each sample. Data for terrestrial 
samples are shown in pale grey; data for the different classes of carbonaceous 
chondrites are shown in dark blue (CI), green (CM), red (CV) and orange (CO). 
In the case of volatilization, preferential loss of the light °‘Zn isotope in the 
residue is expected to accompany the increase of the Mg/Zn ratio, as magnesium 
(Tso = 1,336 K) is much more refractory than zinc (T's) = 726 K). Because the 
opposite is observed, fractionation processes other than volatilization must be 
sought. T;9, at which 50% of the element is condensed”. 


When life is the topic, the presence of liquid water immediately 
becomes an issue. The origin of water and its fate have also been central 
concerns of the planetary sciences, because water changes the rheolo- 
gical properties of planetary mantles and seems to be a prerequisite for 
plate tectonics to operate. A commonly held view is that the terrestrial 
ocean results from the outgassing of the Earth’s mantle’. Recent 
dynamic calculations, however, emphasize the hot temperatures next 
to the nascent Sun, and suggest that the planetary embryos from the 
inner Solar System remained essentially dry. It is only late in accretion 
history that perturbations of asteroid orbits in the inner Solar System 
by giant planets started to launch ice-rich material that delivered water 
to the planets in that part of the Solar System. A simplistic but efficient 
description of the debate on planetary water is that of ‘dry’ versus ‘wet’ 
accretion. 

This Review will first show that the depletion of the volatile content 
of the Earth and other terrestrial planets is not due to loss by out- 
gassing but to incomplete stepwise accretion of nebular material; this 
accretion was interrupted by energetic electromagnetic radiation that 
was emitted by the young Sun during its T Tauri phase and swept 
through the disk. The timing of volatile accretion, water in particular, 
with respect to the giant lunar impact, the formation of the terrestrial 
core, and its implications for the contrasting geodynamic regimes of 
the Earth, Mars, and Venus will be discussed. 


Water in the deep Earth 


The Earth contains water in different forms: a liquid ocean at the 
surface, various forms of ground water and, in the mantle, dense 
hydrous mineral phases and water dissolved in nominally anhydrous 
minerals such as olivine. Evidence for how much water the solid 
Earth contains is robust. In the absence of a gaseous phase, that is, 
at depths in excess of about 90 km, water behaves as an incompatible 
element with properties remarkably similar to those of cerium”: the 
H,O/Ce ratio is about 200 in the source of both mid-ocean ridge 
basalts and ocean island basalts, which are widely taken as melts from 
upper- and lower-mantle material, respectively. Given this strong 
constraint and reasonable assumptions on the degrees of melting 
prevailing during the formation of these melts, the amount of water 
held by the mantle is approximately equivalent to an oceanic mass, 
which translates into 150-350 p.p.m. H,O in the mantle and a bulk 
terrestrial water concentration of 300-550 p.p.m. (refs 3, 4). The 
water content of the Archaean mantle is open to interpretation’, 
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Figure 2 | Stepwise accretion of the elements on cooling of the solar 
nebula, shown as atomization enthalpy versus Tso. The two axes represent 
two different volatility scales, an intrinsic scale given by the atomization 
enthalpy”, which is essentially equivalent to the mean bonding energy per 
atom in solids, and a scale dependent on the nebular chemistry, Tso (see Fig. 1 
legend). Black letters, major elements; grey letters, minor and trace elements. 
The platinum group elements, plus Al, Ti, Zr and W, and most rare-earth 
elements and actinides, which condense first above 1,600 K, are 
preferentially found in refractory inclusions. The group of mildly refractory 


but seems to have been lower than that of the modern mantle. As it is 
particularly difficult to ascertain whether the measured water con- 
tents of lavas reflect their contents at the time of eruption, the Ce 
contents of melt inclusions in olivine from the 2.7-Gyr-old koma- 
tiites from Belingwe, Zimbabwe’, and Alexo, Canada‘, can be used as 
a proxy; they show that if the H,O/Ce ratio has not significantly 
evolved since that time, the mantle at 2.7Gyr ago contained 
~150 p.p.m. H20 for 50% melting and ~60 p.p.m. for 20% melting. 

It has been suggested that much of the mantle’s water may accu- 
mulate in the transition zone in waddsleyite’* and, therefore, that the 
water content of the mantle may not be homogeneous. This is true 
only if water is held back in the transition zone on subduction and not 
dragged down with its carrier minerals. Fractionation across the 
phase-change boundary requires that when wet material is trans- 
ported in and out of the transition zone, the advective flux is com- 
pensated for by diffusion of hydrogen: at the temperatures prevailing 
in the sinking plates (600-1,200 °C), the diffusion coefficients of 
hydrogen in the major minerals are very small’ and the wet boundary 
layer appearing across the reaction zone very thin. Accumulation of 
water in waddsleyite is therefore of little significance to the overall 
budget of water in the mantle. 


Volatile depletion in the terrestrial planets 

Planets formed by accretion of solid material condensed from the 
solar nebula. The strong electromagnetic winds emitted by the early 
Sun swept away the nebular gas and condensation stopped well 
before the debris disk was cleared, a process that lasted less than a 
few Myr after the isolation of the Solar System'®. Most of the material 
that eventually formed the Earth accreted well within the ‘snow line’, 
which is the virtual line beyond which water freezes out!’ ”’*. A widely 
used scale of volatility is the temperature Ts) at which 50% of the 
nebular inventory has been accreted to the solid phase. Let us con- 
sider two elements with widely different values of Ts), namely, the 
refractory element uranium (Ts59 = 1,610 K), and the more volatile 
element potassium (Tso = 1,006 K)*. The K/U ratio is a measure of 
the relative depletion of planetary volatile elements with respect to 
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lithophile elements, which comprise the major elements Si, Mg, Fe and Ca, 
accrete down to 1,300 K as metal, olivine and pyroxenes, and make up the 
bulk of the planetary mantle and core. These refractory elements are 
separated from the high-T chalcophile elements (As, Ga, Ge, Cu, Ag), 
chlorine, and the alkali elements (Li, Na, K, Rb and Cs), which condense 
between 1,150 and 850 K. From 750 to 530 K, the low-T chalcophile elements 
(Pb, Bi, Sn, Zn, Cd, S, Te) precipitate, followed by the truly volatile elements 
(N, C, H) and Hg. Each group appears to be separated from its neighbours by 
a temperature gap (highlighted in white). 


refractory elements. Carbonaceous chondrites and the solar pho- 
tosphere provide a coherent estimate of the initial K/U ratio of 
60,000 (ref. 14). The terrestrial value of ~10,000 (refs 15, 16) indi- 
cates that the Earth lacks 85% of the nebular K inventory, while the 
deficit for the Moon (K/U = 3,000) is 95%. Martian meteorites that 
have been witnessed falling and therefore evaded long periods of 
terrestrial weathering show K/U ratios <20,000 (ref. 17). Depletion 
of 92-98% is also inferred for Zn, Ag, As, Sb, Sn, Pb and, most 
importantly, S (refs 18, 19). Planets that lack such large fractions of 
moderately volatile elements cannot have been endowed with large 
amounts of water. 

As shown by isotopic abundances of different elements, the volatile 
deficit in planetary material is not due to volatilization but to incom- 
plete accretion”. Loss of volatile elements due to impact or volati- 
lization during accretion, whether it reflects planetary escape 
velocities or the molecular stampede of hydrogen escape known as 
hydrodynamic entrainment”', is expected to be mass-dependent, and 
the light isotopes should preferentially be enriched in the vapour 
phase. The homogenous stable isotope composition of K in planetary 
samples is inconsistent with preferential evaporation of the light 
isotopes from initially condensed (CI chondrite) material. Even more 
conspicuously, the Earth is depleted in the heavy isotopes of Zn by 
about 0.1% per atomic mass unit with respect to CI chondrites, as are 
the CV and CO chondrites, which are enriched in refractory elements 
(Fig. 1). This is exactly the opposite of what is expected from pref- 
erential loss of the light °*Zn during volatilization. These observa- 
tions contrast with evidence from lunar soils, which on volatilization 
by impacts become remarkably enriched in the heavy °°Zn and °°Zn 
isotopes by 3—4%o per atomic mass unit”. The processes leading to 
reversed mass fractionation are currently not well understood, and 
may involve kinetic effects”’ or electromagnetic sorting (Hall effect) 
of the ionized fraction of the nebular gas**. Incomplete accretion is 
also inferred from the terrestrial abundances of alkali elements, 
which have comparable volatilities and chemical properties over a 
broad range of atomic mass M. The depletions of K (M= 39, 
Tso = 1,006 K), Rb (M=85, Tso = 800K) and Cs (M= 133, 
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Tso = 800 K)** are of comparable magnitude, which also argues 
against loss by volatilization in a gravity field. 

A robust consequence is that the Earth, like the other terrestrial 
planets, is depleted in volatile elements, not because these were lost by 
impact or vaporization, but because they did not accrete along with 
the more refractory elements in the first place. 


The stepwise accretion of planetary material 
In order to explain the volatile deficit, the idea being pursued here is 
that the nebular gas was blown off by the energetic radiation of the 
young Sun before temperature decreased enough for volatile ele- 
ments to condense on the terrestrial planets. Thermodynamic calcu- 
lations*”** successfully predict the mineralogy and composition of 
planetary objects. Most elements condense within a narrow temper- 
ature interval. As a consequence, accretion takes place as a stepwise 
process and bulk accretion goes through plateaus when shifting from 
one group of elements to another with decreasing temperature. 

The narrow temperature range of condensation of most elements is 
a common feature of thermodynamics calculations'*”””* and reflects 
relatively simple properties of solid—gas equilibria. The temperature 
range AT over which elements condense is approximately RT?,/AHi,, 
where R is the gas constant and AHI, is the atomization enthalpy”. 
Energy of mineral formation from solid elements and energy of melting 
can be safely neglected with respect to the enormous energies involved 
in vaporization. For most elements, AT corresponds to a few tens of 
degrees, except for the alkali elements, Eu and Yb (150-200 K). 

Intrinsic volatility is not the sole control of planetary abundances. 
Figure 2 is a plot of atomization enthalpy versus Ts 9. The highly 
refractory elements—that is, the platinum group elements, Al, Ti, 
Zr, W, and most rare-earth elements and actinides, which are all 
particularly abundant in chondrite refractory inclusions—condense 
above 1,600 K. They are followed by the dominant group of refractory 
lithophile elements (Si, P, and the alkali-earth and transition elements 
typical of chondrules), which condense down to 1,300K. These 
refractory elements are separated from a group comprising the 
high-T chalcophile elements (As, Ga, Ge, Cu and Ag), chlorine, and 
the alkali elements (Li, Na, K, Rb and Cs), which accrete between 1,150 
and 850K. At lower temperatures (750 to 530K), the group of low-T 
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Figure 3 | Stepwise accretion of the elements upon cooling of the solar 
nebula, shown as fraction condensed versus temperature. Plotted is the 
fraction of the original condensable matter remaining in the solar nebula at a 
given temperature (redrawn with changes from ref. 30). The kinks in the 
accretion rate correspond to the temperature gaps in Fig. 2. After formation of 
the most abundant refractory phases (plagioclase and clinopyroxene), the 
bulk of the metallic core and the mantle silicates (olivine and orthopyroxene) 
is quickly removed from the nebular gas (purple). Alkali elements and high-T 
chalcophile elements precipitate next (green), but do not constitute a large 
fraction of the planetary material. Low-T chalcophile elements, sulphur and 
halogens come after (red), followed by volatile elements. Most elements 
accrete over a few tens of degrees, but over 150-250 K for the alkali elements, 
and Eu and Yb. A planet with a substantial deficit of K or Zn with respect to 
chondrites therefore cannot have accreted much, if any, water. 
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chalcophile elements (Pb, Bi, Sn, Zn, Cd, S and Te) and the rest of the 
halogens leave the vapour phase. A last group consists of the three 
most volatile elements (N, C and H) and Hg. Each group appears to be 
separated from its neighbouring groups by a temperature gap. The 
stepwise accretion of the planetary material is well illustrated in refs 30 
and 31, which show that, at 1,200 K, 50-60% of the nebular inventory 
is still in the gas phase (Fig. 3). This reflects the combination of 
the elements into stoichiometric solid phases such as oxides, olivine, 
pyroxenes, feldspars and metal”, and, to a lesser extent, the intrinsic 
volatility of the elements themselves. 

The energetic electromagnetic radiation of the young Sun swept 
away the nebular gas and stopped accretion while the nebular tem- 
perature in the neighbourhood of the terrestrial planets was still in the 
temperature range of alkali element condensation (800-1,000 K)*°***? 
(Fig. 4). This is the main cause of volatile element depletion in the 
inner Solar System. In contrast, in material accreted at the snow line 
and beyond, water and other volatile elements were plentiful. The ‘late 
veneer hypothesis**, which holds that small amounts of chondritic 
material from the asteroidal belt or beyond were added to the Earth at 
an unspecified late stage, was devised to account for the present-day 
excess of highly siderophile elements, such as the platinum group 
elements, in the terrestrial mantle. The depletion of the most volatile 
elements discussed above places an upper limit on the proportion of 
low-temperature ClI-like material of the order of 2-5%, which is 
still somewhat higher than the 0.3% proportion of carbonaceous 
chondrite-type (with ~ 10% water) material that is deemed* necessary 
to account for terrestrial water. 


Making planets and adding water 

One model” of planetary growth and development may be summarized 
into three stages: (1) the settling of dust onto the equatorial disk of the 
planetary nebula and formation of kilometre-sized planetesimals, (2) 
the runaway growth of planetesimals into Mars-sized planetary 
embryos, and (3) collision of planetary embryos to form the planets 
with more or less their modern masses. How fast the nebular gas was 
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Figure 4 | The thermal structure of the planetary solar nebula: 
temperature at mid-plane of the nebular disk. Data are taken from ref. 88. 
Two models of temperature distribution are shown for two different values 
of the viscous dissipation parameter, « (viscosity X sound velocity at mid- 
plane X mean elevation above mid-plane) for a rate of solar accretion of 1% 
per Myr: « = 10 ° (solid line) and « = 10! (dashed line). The position of 
the wiggly ‘snow line’, which separates two domains, ice-free inwards and 
frosty outwards, nearly coincides with that of the asteroid belt'*. Colour 
coded areas correspond to the four groups of elements identified by their Tso 
values in Fig. 2. The depletion of volatiles in the inner Solar System is caused 
by the strong electromagnetic winds emitted by the young Sun, which swept 
away the nebular gas before accretion was complete. 
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blown off by energetic solar radiation is unclear: the delay is constrained 
by the short lifetime (<3 Myr) of the debris disk'® and by dynamical 
models, which require a reduction of planetary eccentricity by dynamic 
friction between planets, asteroids and planetary embryos”. It is likely, 
however, that the planets may have accreted to a substantial fraction of 
their final mass before the disk was blown off. 

Water delivery by comets” with a D/H ratio of 3 X 10~* (refs 38, 
39) seems incompatible with the terrestrial D/H ratio of the terrestrial 
ocean (1.5 X 10 *)*°. Carbonaceous chondrites represent an alterna- 
tive source of terrestrial water'*’. Orbits of planetary embryos across 
the Solar System during accretion have been computed* and these 
authors concluded that the bulk of the water at present on Earth was 
carried by only a few of these objects, originally formed in the outer 
asteroid belt and accreted to the Earth during the waning stages of 
its formation. As long as the nebular gas was present and was absorb- 
ing electromagnetic radiation, the inner Solar System was too hot to 
allow for significant water condensation along with metal and sili- 
cates: only during the late stages of accretion had temperature declined 
enough for water delivery to be efficient. Dynamical models” suggest 
that early accretion of asteroid and planetary embryos was local, and 
therefore that the accreted material was largely dry up to the snow line 
and ice-rich beyond. It is also found that radial mixing and inward 
migration of wet high-eccentricity objects from the outer Solar System 
increased dramatically in the later phases of accretion. Water absorp- 
tion in the hot inner Solar System has been proposed* but, as two- 
dimensional liquid films cannot form below the critical temperature 
of water, this process is unlikely to be significant. 

The mechanism of water accretion depends on the relative timing 
of three important events—the segregation of the terrestrial core, the 
giant lunar impact, and the arrival of the late veneer (Fig. 5). The 
identical '**W excesses found in the silicate fractions of the Earth and 
the Moon require that the giant impact took place at the earliest 60 
Myr after the formation of calcium-aluminium-rich inclusions* 
unless the Hf/W ratios of the two planetary bodies also are similar, 
in which case this time can be reduced to ~30 Myr (refs 46, 47). A 
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Figure 5 | A tentative chronology of the Earth's accretion. Chronometers 
shown in brown. Accretion of planetary material was interrupted by 
energetic electromagnetic radiation (T Tauri phase) sweeping across the disk 
within a few Myr of the isolation of the solar nebula. Runaway growth of 
planetesimals produces Mars-sized planetary embryos, which, collision after 
collision, form the planets with their modern masses. The last of these ‘giant’ 
collisions left material orbiting the Earth that later reassembled to form the 
Moon. The '**Hf-'*?W chronometer dates metal-silicate separation. The 
identical abundance of radiogenic '*’W between the Earth and the Moon 
indicates that either the Moon formed after all the short-lived '**Hf had 
disappeared (>60 Myr) or, rather, the Moon-forming impact and terrestrial 
core segregation took place simultaneously 30 Myr after isolation of the solar 
nebula. Addition of a late veneer of chondritic material coming from beyond 
2.5 AU provides a strong explanation for the modern abundances of 
siderophile and volatile elements in the terrestrial mantle. This material also 
contained water and other volatile elements, which account for the origin of 
the terrestrial ocean. Such a model indicates that most of the terrestrial Pb 
and Xe was delivered by the asteroids that constituted the late veneer, and 
therefore that the young Pb—Pb and I—Xe ages of the Earth date, not the 
Earth, but events that affected the asteroids. It is suggested here that these 
events are those of the accretion to the Earth of the late veneer. 
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lunar impact with a wet Earth, that is, after substantial amounts of 
late veneer had been accreted, is difficult to reconcile with the 
remarkable depletion of volatiles in the lunar mantle**** with respect 
to the terrestrial mantle. The contrasting abundances of the stable 
(unradiogenic) Ne isotopes 20 and 22 (ref. 49) and of the stable Xe 
isotopes 124, 128 and 130 (ref. 50) between the terrestrial mantle, 
represented by basalts and CO wells, and the terrestrial atmosphere 
also require that a substantial fraction of the volatiles was accreted 
after the last large-scale melting of the Earth. The invariant propor- 
tions of the stable Ar isotopes 36 and 38 in all terrestrial solids and 
gases’'* indicates that isotope fractionation of Ne and Xe was not 
caused by hydrodynamic escape. Although the case has been made 
recently that the lunar mantle may locally hold traces of water?’, the 
depletion of the Moon in volatile elements, regardless of their atomic 
weights and therefore of gravitational loss, remains unexplained. 
However, the occasional closeness of planetary bodies with very dif- 
ferent degrees of hydration and, because of lateral transfer, the highly 
variable water contents of impactors are strong features of dynamic 
simulations”. 

Because Pb is volatile, with a T5) value (~725 K; ref. 13) similar to 
that of Zn, the arrival of the late veneer can be dated by Pb—Pb 
chronology. Old feldspars and galenas indicate Pb—Pb ages of the 
Earth younger by 50-160 Myr relative to the age of the Solar 
System™ *° (Fig. 6). The delayed ingrowth of radiogenic Pb therefore 
is best explained by impacts of asteroids with very low, chondrite-like 
u (7°8U/?™Pb) ratios® onto a volatile-depleted, high- proto-Earth. 
The yt value of the lunar mantle attested to by Pb isotope systematics 
of Apollo samples is 200-600 times the ratio of the modern terrestrial 
mantle and 1000—10,000 times that of CI chondrites*’. Simple mass 
balance using the py values suggests that well over 99% of terrestrial Pb 
was added by the late veneer. The young Pb—Pb age of the Earth 
therefore appears to be the mean age of the arrival of the late veneer 
material and does not represent a date in the history of primordial 
terrestrial Pb. Some U/Pb fractionation due to volatilization or to 
incomplete mergers (‘hit-and-run’ collisions) during the encounter 
of these asteroids with the Earth may account for the young Pb—Pb 
ages. Likewise, the apparent I-Xe and Pu—Xe ‘ages of the Earth’, 
which are bracketed between 50 and 110 Myr (refs 58-60), date the 
arrival on Earth of I and Xe, two highly volatile elements, and are 


Age, T (Gyr) 
1.0 2.0 3.0 4.0 5.0 


206Pph/204Pph 


4.40 + 0.04 Gyr: 


0.0 0.2 0.4 0.6 0.8 1.0 12 
exPUy3gu7) - 1 


Figure 6 | Galena data and the young age of the Earth. Data are taken from 
ref. 54. The evolution of 7°°Pb/?™Pb in conformable Pb deposits is linear in 
exp(/23su T) — 1, where /23gu is the decay constation of 238), which 
indicates that the mantle source of galenas behaved as a closed system since 
the early geological ages. The primordial Pb of the Solar System, represented 
by the 4.56-Gyr-old Canyon Diablo troilite®’, is not on the galena trend, 
which suggests that terrestrial Pb is younger than that of this meteorite. This 
condition reflects that either Pb segregated into the core at a late stage*’ or, as 
suggested here, that terrestrial Pb was largely inherited from the late 
accretion of volatile-rich material (late veneer) to the Earth. 
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reasonably consistent with the Pb—Pb age. Hence, arrival of the late 
veneer about 100 + 50 Myr after the collapse of the nebula is sup- 
ported by independent sets of geochemical observations. 

The nature of the wet, late-accreting icy bodies from beyond the 
snow line, either identified meteoritic or so-far unsampled material, is 
not understood’™". It is likely that any source of planetary material that 
significantly contributed to planetary accretion during the first 50 Myr 
of the Solar System is now exhausted. This makes the quest for a 
particular type of modern chondritic contributor intrinsically elusive. 


Hydrating the mantle and setting off planetary dynamics 


If water is a late addition to terrestrial accretion, the question arises of 
when and how modern mantle water was introduced into the solid 
Earth*. Solution in the primordial magma ocean (ingassing)*! 
requires a very high atmospheric temperature (>800 °C) to prevent 
mantle hydration. Foundering the hydrous crust of serpentine and 
other hydrous minerals” of the magma ocean is an alternative mech- 
anism for early deep Earth hydration. In both cases, however, crys- 
tallization of the magma ocean is expected to concentrate water, a 
very incompatible molecule, in the residual melt and therefore acts to 
remove most of the water initially present in the upper mantle. 

Water and all the other incompatible elements are particularly 
depleted in the upper mantle for two main reasons. First, orogenic 
volcanism at converging plate boundaries removes most of the volatile 
and incompatible lithophile elements from the hydrous lithospheric 
plates. Later, mid-ocean-ridge magmas remove from the upper mantle 
most of what survived the subduction ‘filter’. In addition, it is a general 
feature that magma extraction under mid-ocean ridges and ocean 
islands is a rather shallow process (<100 km) that takes place during 
decompression of the mantle: the deeper into the mantle water and 
incompatible elements are buried, the less likely they are to become 
involved in melting and extraction, which is apparent in convection 
models dealing with geochemical tracers®’. Water is continuously lost 
to the deep mantle by subduction of a post-serpentine hydrous phase 
known as phase D**. The presence of hydrous phases at the top of the 
lower mantle may actually account for stronger seismic attenuation at 
subduction zones than in the ambient mantle’’. 

Water decreases the viscosity of major mantle minerals, notably 
olivine®’. It reduces the strain before lithospheric failure and there- 
fore affects mantle dynamics®. A strong reduction of mantle viscosity 
contingent on subtle olivine hydration is suspected to shift the elastic 
thickness of the lithosphere and to change mantle dynamics from a 
stagnant-lid regime (strong dry mantle) to a plate tectonic regime 
(soft wet mantle)”°. 

The varying fates of water on the different terrestrial planets provide 
a relatively simple explanation of their contrasting evolution. 
Admitting that water and Ce have similar residence times of 
6-8 Gyr (ref. 71) in the terrestrial mantle, ~50% of the ocean must 
have been subducted into the mantle after 4.5 Gyr, which is precisely 
what the widely held view of ‘one ocean inside for one ocean outside’ is 
predicting. 

The status of water on Venus can be explained by conditions 
opposite to those on the Earth: hot surface vapour quickly reacts 
with the crust and is rapidly transported downwards because the 
more hydrous the mantle, the faster its convection. The loss of water 
from the surface is largely controlled by its reactivity and therefore by 
surface temperature. On both planets, the depletion of the upper 
mantle acts to hold water back in the lower mantle. The unique 
hypsometric maximum on Venus contrasts, however, with the bimo- 
dal distribution of elevations on Earth’”* and demonstrates its lack of 
true continents. Evidence of plate tectonics, if it existed in the past, 
was erased by resurfacing ~600-900 Myr ago’*”*. Although some 
water is currently lost from the atmosphere of Venus”’, the relative 
abundances of atmospheric rare gases argues against hydrodynamic 
escape in the past’®. If Venus and the Earth initially had oceans of 
similar sizes, an alternative may be found to atmospheric loss on 
Venus: water may simply have been dragged into the mantle by early 
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plate tectonics. Eventually, this may become the fate of the terrestrial 
oceans and continents as well, once enough water has been intro- 
duced into the mantle, making it softer than it is today. 

It has been argued” that Martian water was delivered by asteroids 
and comets from beyond 2.5 Au. Also, it was suggested that Mars’ 
interior is dry because, although the planet acquired a late volatile- 
rich veneer, it did not get folded into the mantle’*. A dry mantle 
explains the early demise of plate tectonics on Mars. In contrast to 
the Earth and Venus, Mars lost its atmosphere to space by hydro- 
dynamic escape during the first few hundred Myr after accretion”. 
Fractionation of the atmospheric **Ar/**Ar ratio attests to hydrody- 
namic escape’’. The weak gravity field due to the small planet radius 
has a triple effect: (1) escape from the atmosphere is much easier on 
Mars than on Earth; (2) because the pressure gradient is about three 
times weaker on Mars, the phase changes giving rise to the viscosity 
jump of the terrestrial transition zone are shifted to the bottom of the 
Martian mantle, thereby preventing significant accumulation of 
water at depth; and (3) the bending of plates and, hence, the onset 
of plate tectonics is made more difficult. It has been suggested that the 
mantle source of Martian shergottites could be wet***', but this is 
only possible if these rocks are younger than a few hundred Myr, a 
point that was recently challenged on the basis of Pb—Pb ages in excess 
of 4 Gyr (ref. 82). 

Contrasting dynamic regimes, and, therefore, the amount of water 
present in planetary mantles, have diverse effects on the dynamo: a 
plate tectonic regime allows heat to be evacuated more efficiently 
from the core than a stagnant-lid regime*’. The Earth, and possibly 
early Venus, illustrates the former case with active plate tectonics. In 
the latter case, the dynamo chokes, the magnetosphere vanishes, and 
the atmosphere gets eroded by solar wind: this was probably the fate 
of Mars, initially triggered by a water-poor mantle and the weak 
gravity field of the planet, and may become the fate of Venus. 


Future directions 


Beyond the fascinating question of how the Earth formed and 
acquired its hydrosphere lies the question of the origin of life. 
Associating life on Earth with plate tectonics and the presence of 
continents, all of them being contingent on the presence of water, is 
tantalizing and justifies the ‘follow the water’ motto of space agencies. 
Although the reducing environments created on reduction of water by 
the magma ocean may have jump-started biological activity, it is not 
clear that a ‘water world’ can provide a steady source of nutrients and 
sustain life*’. Both the timing of water delivery (relevant to the onset of 
plate tectonics) and the amount of water (for the persistence of a 
shallow ocean with subaerial reliefs) need to be understood. The 
detailed chemical reactions at work during the giant lunar impact*® 
and even its timing remain poorly understood. New dynamical 
models of accretion will need to deal with radial transfer across the 
nebula and with hit-and-run impacts that severely affect planetary 
chemistry. The past decade witnessed major conceptual changes in 
our understanding of the early history of the terrestrial planets, and 
more surprises seem to be ahead of us. 
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What recent ribosome structures have 
revealed about the mechanism of translation 


T. Martin Schmeing’ & V. Ramakrishnan’ 


The high-resolution structures of ribosomal subunits published in 2000 have revolutionized the field of protein translation. 
They facilitated the determination and interpretation of functional complexes of the ribosome by crystallography and 
electron microscopy. Knowledge of the precise positions of residues in the ribosome in various states has facilitated 
increasingly sophisticated biochemical and genetic experiments, as well as the use of new methods such as single-molecule 
kinetics. In this review, we discuss how the interaction between structural and functional studies over the last decade has led 
to a deeper understanding of the complex mechanisms underlying translation. 


he ribosome is the large ribonucleoprotein particle that 

synthesizes proteins in all cells, using messenger RNA as 

the template and aminoacyl-transfer RNAs as substrates. 

Ribosomes from bacteria consist of a large (50S) and a small 
(30S) subunit, which together compose the 2.5-megadalton 70S ribo- 
some; their eukaryotic counterparts are the 60S and 40S subunits and 
the 80S ribosome. The 50S subunit consists of 23S RNA (~2,900 
nucleotides), 55 RNA (~120 nucleotides) and about 30 proteins; 
the 30S subunit consists of 16S RNA (~1,500 nucleotides) and about 
20 proteins. In addition, several protein factors act on the ribosome at 
various stages of translation. In this review, we focus mainly on 
structural and mechanistic insights into bacterial translation 
obtained in the last few years. A previous review deals more exten- 
sively with earlier work’, 

The essentially complete atomic structures ofan archaeal 50S subunit 
from Haloarcula marismortu? and a bacterial 30S subunit from 
Thermus thermophilus’ published in 2000 were the basis for the phasing 
and/or molecular interpretation of every subsequent structure of the 
ribosome or its subunits. Such structures include low-resolution struc- 
tures of the 70S ribosome by crystallography* or cryoelectron micro- 
scopy (cryoEM)°, the structure of a bacterial 50S subunit®, and more 
recent high-resolution structures of the 70S ribosome”*. Finally, mobile 
elements of the 50S subunit such as the L1 or L7/L12 stalks that are 
partly or completely disordered in most high-resolution structures of 
the ribosome or the 50S subunit have been solved in isolation””®. 

The basic architecture of the ribosome is shown in Fig. 1. The 
interface between the two subunits consists mainly of RNA. The 
mRNA binds in a cleft between the ‘head’ and ‘body’ of the 30S 
subunit, where its codons interact with the anticodons of tRNA. 
There are three binding sites for tRNA: the A site that binds the 
incoming aminoacyl-tRNA, the P site that holds the peptidyl-tRNA 
attached to the nascent polypeptide chain, and the E (exit) site to 
which the deacylated P-site tRNA moves after peptide-bond forma- 
tion before its ejection from the ribosome. In the 50S subunit, the 3’ 
ends of P- and A-site tRNAs are in close proximity in the peptidyl- 
transferase centre (PTC), whereas the 3’ end of the E-site tRNA is 
~50 A away from the PTC. 


Initiation 
Bacterial translation can be roughly divided into three main stages, 
initiation, elongation and termination (Fig. 2; a movie of the process 


can be seen at http://www.mrc-lmb.cam.ac.uk/ribo/homepage/ 
movies/translation_bacterial.mov). Initiation requires the ribosome 
to position the initiator fMet-tRNA™* over the start codon of 
mRNA in the P site. In bacteria, the ribosome is positioned in the 
vicinity of the start codon by base pairing between the 3’ end of 16S 
RNA and an approximately complementary sequence just upstream 
of the mRNA start codon, called the Shine-Dalgarno sequence. The 
precise positioning of the start codon in the P site requires the bind- 
ing of a special initiator fMet-tRNA™" and three initiation factors, 
IF1-3. However, exactly how the correct tRNA is selected remains 
unclear, as are the roles of the various factors. 

A probable first step in initiation is the binding of IF3 to the 30S that 
has been split from the 50S by ribosome recycling factor RRF and 
elongation factor G (EF-G) after translational termination (see Fig. 2 
and the termination section later). This binding stimulates release of 
the mRNA and deacylated tRNA, leftover from the previous round of 
translation, from the 30S and prevents the large subunit from re- 
associating''’*. The binding of the 30S-IF3 complex to mRNA, IF1, 
IF2 and initiator tRNA results in the 30S initiation complex (30S-IC). 
IF2, a GTPase, promotes subunit joining to form the 70S initiation 
complex (70S-IC), which is accompanied by IF3 release’*'*. After 
GTP hydrolysis and phosphate release from IF2 (refs 16, 17), fMet- 
tRNA™* moves into the PTC, readying the ribosome for elongation. 

The mechanism of initiation is still unclear, owing to a paucity of 
structural data. There has been little progress towards high-resolution 
structures of initiation complexes since the structure of IF1 bound to a 
30S subunit'*. However, recent cryoEM studies have visualized both 
30S and 70S initiation complexes. In a 30S-IC (ref. 19), which un- 
fortunately did not contain IF3, IF2 stretches across the subunit inter- 
face of the 30S, contacting the acceptor end of fMet-tRNA™* with its 
carboxy terminus. The anticodon stem and elbow are shifted towards 
the E site, resulting in a ‘30S P/I state’. IF1 is visible in the A site, but does 
not contact IF2. After subunit joining, the G domain of IF2 interacts 
with the GTPase centre of the large subunit”®. It maintains its contacts 
with fMet-tRNA™", which has shifted up out of plane from the 30S P/I 
state to a 70S P/I state, and seems to make a direct contact with IF1 in the 
70S-IC. The 30S subunit is rotated relative to the 50S by ~4° anticlock- 
wise, similar to the ratcheting seen during translocation”’. 

In the structure of 70S-mRNA—fMet-tRNA™*_IF2-GDPCP~, 
IF2 is still bound to the GTPase centre, but has lost contact with 
fMet-tRNA™*, now in the PTC in the canonical P/P state. The 
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Figure 1| Structure of the ribosome. a, ‘Top’ view of the 70S ribosome with 
mRNA and A- P- and E-site tRNAs. b, ¢, Exploded view of the 30S subunit 
(b) and 50S subunit (c). The structure of the L7/L12 arm’® was fit onto the 


authors have suggested that this conformation represents the state 
after GIP hydrolysis before P; release. Alternatively, another group 
has suggested that it is the result of the absence of IF1 and IF3 (ref. 23). 
The 70S complex with the GDP state of IF2 has the 30S subunit 
returned to the un-ratcheted state and IF2 largely separated from 
the GTPase centre, ready to dissociate from a properly initiated 
70S ribosome”. Single-molecule fluorescence resonance energy trans- 
fer (FRET) studies show that this subunit rotation, which readies the 
ribosome for elongation, requires GTP hydrolysis”, thus supporting a 
direct role for the GTPase activity of IF2 in initiation, which has been 
in dispute’*”’. 


The elongation cycle 

The elongation cycle consists of the steps involved in sequentially 
adding amino acids to the polypeptide chain (Fig. 2). At the begin- 
ning of the cycle, the ribosome contains a peptidyl-tRNA with a 
nacent polypeptide chain in the P site and an empty A site. During 
decoding, the next amino acid is delivered in a ternary complex of 
elongation factor Tu (EF-Tu), GTP and aminoacyl-tRNA. Decoding 
is followed by peptide-bond formation, resulting in the elongation of 
the polypeptide chain by one amino acid. EF-G-catalysed transloca- 
tion moves the tRNAs and mRNA with respect to the ribosome. 
Decoding. Decoding ensures that the correct aminoacyl-tRNA, as 
dictated by the mRNA codon, is selected in the A site. The binding 
of the appropriate ternary complex in the A site of the ribosome results 
in GTP hydrolysis by EF-Tu, the dissociation of the factor from the 
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70S ribosome”, with mRNA elongated by modelling. This and all other 
figures were made with Pymol (Delano Scientific) and Photoshop (Adobe). 


ribosome and the movement of the aminoacyl end of A-site tRNA into 
the PTC, termed accommodation (Fig. 3). The many steps of decod- 
ing have been dissected by pre-steady state kinetic measurements”® 
and single-molecule FRET studies”. 

The high accuracy of tRNA selection cannot be accounted for by 
just the free energy differences between base pairing and mismatches 
of the codon and anticodon”*”’, even considering the contribution of 
proofreading. Instead, interactions made by three universally con- 
served bases of the ribosome with the minor groove of the first two 
base pairs of the codon—anticodon helix gives rise to further discrimi- 
nation (Fig. 3)°°. Such close monitoring of base-pairing geometry by 
the ribosome does not occur at the wobble position, consistent with 
the degeneracy of the genetic code. The binding energy of these extra 
interactions is not used primarily to increase the relative affinity of 
cognate versus near-cognate tRNA, but instead to induce a domain 
closure in the 30S subunit*', which presumably leads to the accelera- 
tion observed in rates of the forward steps in decoding”. 

CryoEM studies of EF-Tu at increased resolution**** show that EF- 
Tu contacts the shoulder domain of the 30S subunit. Thus, domain 
closure would move the shoulder domain of the 30S subunit towards 
the ternary complex”’, potentially stabilizing the transition state for 
GTP hydrolysis by EF-Tu” and leading to an acceleration of GTPase 
activation and tRNA selection. It seems that mutations or antibiotics 
that facilitate domain closure decrease the accuracy of the ribosome, 
whereas mutations that make domain closure more difficult result in 
increased accuracy’. 
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Figure 2 | Overview of bacterial translation. For simplicity, not all intermediate steps are shown. The colour scheme shown here is used consistently 
throughout this review. aa-tRNA, aminoacyl-tRNA; EF elongation factor; IF, initiation factor; RF, release factor. 


These cryoEM structures, the most recent of which are beyond 7 A 
resolution***®, also show that the tRNA is bent at the anticodon stem 
(Fig. 3f). The anticodon stem in the decoding centre is very nearly in 
the orientation acquired after accommodation and movement of the 
acceptor arm into the PTC. Thus, the binding energy derived from 
base pairing between the correct codon—anticodon is not only used to 
induce a conformational change in the ribosome, but also to distort 
the tRNA. A distorted tRNA may be characteristic of the transition 
state for GTP hydrolysis by EF-Tu, consistent with experiments 


PtRNA 


Figure 3 | Decoding by the ribosome. a, In the apo ribosome, A1492 and 
A1493 are stacked in h44. b, When a cognate tRNA bind to mRNA in the A 
site, A1492, A1493 and G530 change conformation to interact with the minor 
groove of the mRNA-tRNA minihelix” c-e, Interactions of the 30S with the 
codon-anticodon pair. In the first (¢) and second (d) positions, ribosomal 
bases monitor the geometry of the minor groove of the base pairs. Protein $12 
also interacts with the second and third (e) positions. f, The ternary complex 
of EF-Tu and aminoacyl-tRNA with the 70S ribosome shows that the tRNA is 
bent in the anticodon stem (for example, see refs 35, 36). 
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showing that a fragmented tRNA is unable to carry out decoding”. 
In addition, recent mutational data on S12, a protein at the shoulder 
of the 30S subunit with a tail that stretches into the decoding centre, 
suggest it may be involved in relaying changes induced at the decod- 
ing centre to the ternary complex”. 

As this review was going to press, the crystal structure of EF-Tu and 
tRNA bound to the ribosome was determined”. This structure shows 
details of the tRNA distortion that allows aminoacyl-tRNA to interact 
with both EF-Tu at the factor-binding site and the decoding centre of 
the 30S subunit. Furthermore, a series of conformational changes in 
aminoacyl-tRNA and EF-Tu that occur after productive ribosome 
binding suggest a communication pathway between the decoding 
centre and the GTPase centre of EF-Tu, which would trigger GIP 
hydrolysis after codon recognition. 

After release of EF-Tu, the tRNA relaxes into the PTC*™*. If 
the anticodon stem loop is held tightly at the decoding centre 
(as in the closed form induced by cognate tRNA), accommodation 
is accelerated*®. However, recent work on the Hirsh suppressor tRNA 
(a mutant Trp tRNA that recognizes the UGA stop codon) shows that 
this tRNA leads to acceleration of GTP hydrolysis and apparently 
accommodation with a near-cognate codon—anticodon pairing*’. 
Thus, the mutant tRNA may be stabilized by additional interactions 
with the ribosome, rather than simply showing enhanced flexibility. 

The discrimination achieved from monitoring the minor groove 
geometry in the codon—anticodon helix by decoding centre nucleotides 
though A-minor interactions can potentially yield an accuracy of 
~10°-10* in a single step”. Should the ribosome use this discrimina- 
tion, then with proofreading, it would be possible to obtain much 
higher accuracy than is usually reported. Evidently, the ribosome 
forgoes accuracy by using the binding energy of codon—anticodon 
recognition to induce conformational changes in the ribosome and 
tRNA that result in accelerated GTP hydrolysis and tRNA selection. 
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However, a recent result suggests that the ribosome is capable of com- 
bining very high accuracy (>10°) with a speed comparable to that of in 
vivo protein synthesis (~22 amino acids added per second)*’, both of 
which are much higher than previous measurements in vitro (accuracy 
~450, speed ~6.6 5 1)22 In the recent experiments”’, the accommoda- 
tion of tRNA into the PTC is apparently too fast to allow significant 
discrimination by proofreading after GTP hydrolysis. If so, the struc- 
tural basis of how one could have such a high accuracy with little or no 
proofreading is not clear, nor why measured in vivo rates of misincor- 
poration are so much higher (reviewed in ref. 29). Further experiments 
with other reporters and in varying conditions are required to clarify 
these differences. 

Peptide-bond formation. The central chemical event in protein syn- 
thesis is the peptidyl-transferase reaction, in which the o-amino 
group of the aminoacyl-tRNA nucleophillically attacks the ester carbon 
of the peptidyl-tRNA to form a new peptide bond (Fig. 4a; see the 
movie at http://www.sciencedirect.com/science/MiamiMultiMediaURL/ 
B6WSR-4HHX2B2-B/B6WSR-4HHX2B2-B-2/7053/html/d074e3c1 
ecf8e4064d37dd72bc0b7e93/Movie_S1..mov). The ribosome increases 
the rate of this reaction by at least ~10°-fold**. The catalytic site is in 
domain 5 of the 23S RNA, which binds the CCA ends of aminoacyl- 
and peptidyl-tRNA (Fig. 4b). It was located precisely in crystal struc- 
tures of the H. marismortui 50S subunit*’, at the bottom of a large 
cleft (Fig. 4b). These structures precipitated many studies aimed at 
determining the catalytic mechanism of the peptidyl-transferase re- 
action. An initial proposal for a general acid/base catalytic mechanism 
involving N3 of A2451—a nucleotide in very close proximity to 
substrate analogues***°—was disproved by the dispensability of 
A2451 for the peptidyl-transferase reaction’”*'. Furthermore, crystal 
structures with improved resolution and more accurate transition state 
mimics showed that N3 of A2451 is not within hydrogen-bonding 
distance of the nucleophile throughout the reaction®”°’. When the 
reactive o-amine was substituted with a hydroxyl, making chemistry 
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Figure 4 | Peptide-bond formation. a, Schematic drawing of the reaction. 
Ade, adenine. b, Binding of tRNAs to the PTC. c, The «-amino nucleophile 
is positioned by interaction with the 2’ OH of A76 of peptidyl-tRNA and N3 
of A2451, as part of an extensive network of hydrogen bonds”. d, e, Possible 
mechanism by which the intermediate of the reaction breaks down into 
products, by a proton shuttle involving the 2’ OH of A76 of peptidyl-tRNA. 
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rate limiting™, a pH-independent reaction rate was observed. This is 
strong evidence that there is no general acid/base catalysis involving a 
group with near-neutral pK,, on A2451 or any other ribosomal moiety. 

If there is no acid/base catalysis, what is the source of catalytic 
power of the ribosome? As with all enzymes, the precise organization 
of substrates and the active site plays an important contribution. 
In the ribosome, this is achieved when the binding of aminoacyl- 
tRNA induces a conformational change of the PTC and peptidyl- 
tRNA® (http://www.nature.com/nature/journal/v438/n7067/extref/ 
nature04152-s6.mov, http://www.nature.com/nature/journal/v438/ 
n7067/extref/nature04152-s7.mov). The a-amino group of the 
aminoacyl-tRNA interacts with the N3 of A2451 and the 2' OH of 
A76 of the peptidyl-tRNA, as part of an extensive network of hydro- 
gen bonds that position the substrates for reaction (Fig. 4c)*°**. It 
had been proposed that binding and orienting of substrates accounts 
for most of the ribosomal rate enhancement”. A comparison of the 
rate of peptide-bond formation by the ribosome and by a ribosome- 
free model system suggested that the ribosome accelerated the re- 
action solely by entropic effects’, which may include substrate 
positioning, shielding the reaction from bulk solvent, or organization 
of the active site***’. A precisely positioned water molecule interacts 
with the highly polarized transition state, as an oxyanion hole**”’. 

Although structural and biochemical studies have found no ribo- 
somal group that acts in chemical catalysis, a substrate-assisted 
mechanism is possible. The 2’ OH of the peptidyl-tRNA is well posi- 
tioned to abstract and donate protons from the nucleophile and 
leaving group, respectively******. Several studies suggest that this 
hydroxyl is vital for the reaction®”®’*’, whereas one group proposes 
it is dispensable”. In the most rigorous study, Weinger et al.® sub- 
stituted the 2’ OH of A76 of peptidyl-tRNA with H or F (ref. 63), and 
found a rate reduction of at least 10°-fold. The importance and 
proximity of the 2’ OH led to the proposal of a concerted proton 
shuttling mechanism, whereby it simultaneously accepts a proton 
from the %-amino group and donates one to the 3’ O leaving group, 
perhaps as part of a six-membered ring of interactions” (Fig. 4d, 
e). Such a mechanism may not require perturbation of the pK, of the 
2' OH pKy. 

Many mechanistic insights and biochemical experiments of the 
peptidyl-transferase reaction are based on structures of H. marismor- 
tui 50S complexes. It was questioned whether this reductionist sys- 
tem, which only includes the large subunit and the terminal 
nucleotides of tRNA, accurately represents the process in the whole 
ribosome with intact substrates”****. However, the 50S can catalyse 
the peptidyl-transferase reaction at similar rates to the 70S ribosome 
using a small dinucleotide A-site substrate, provided that a full- 
length tRNA is present in the P site®’. This analogue also shows the 
same robustness against active site mutations, and a pH profile 
similar to full aminoacyl-tRNAs in 70S ribosomes™. Finally, recent 
structures show that a 70S ribosome with full-length tRNA substrates 
show that the PTC and substrate conformations are essentially iden- 
tical to those in structures of the 50S with substrate analogues”. 

Although the ribosome is asymmetric, a pseudo-two-fold axis of 
symmetry exists at the PTC, relating the A and P sites”. It is likely that 
23S RNA started as a molecule of around 100 nucleotides, which 
duplicated to allow the proto-ribosome to bring two (non-coded) 
substrates into proximity®”’°. Careful analyses of the tertiary inter- 
actions reveal an evolutionary pathway of expansion of this proto- 
ribosome, giving rise to 23S RNA”. 

Recent studies shed light on the role of two proteins previously 
implicated in peptidyl transfer. In bacteria, the amino terminus of 
L27 could be crosslinked to the 3’ end of both A- and P-site tRNAs, 
showing it was part of the PTC’. Deletion or N-terminal trun- 
cation of L27 results in reduced peptidyl-transferase activity’* and 
computer simulations suggest the role of L27 is to aid binding of 
aminoacyl-tRNA”. In addition, deletion of L16 was shown to cause a 
deficiency in A-site tRNA binding and the rate of peptidyl trans- 
fer’*”°. Recent structures show that the N-terminal tail of L27 is 
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ordered in the PTC where it interacts with the tRNA substrates*® 
(Fig. 4b), and that L16 becomes ordered owing to its interactions 
with the acceptor arm of A-site tRNA, rationalizing these findings. 
Thus, some proteins seem to aid the RNA components that primarily 
facilitate the peptidyl-transfer activity of the ribosome. 
Translocation: the formation of hybrid states. With peptide-bond 
formation, the nascent peptide chain is transferred to the A-site tRNA 
leaving a deacylated tRNA in the P site. Before the next round of elonga- 
tion, the tRNAs and mRNA need to move relative to the ribosome. 
During translocation the mRNA shifts by precisely one codon, except 
when either errors or programmed frameshifts occur. The tRNAs must 
also translocate from the A and P sites to the P and E sites, requiring a 
movement as large as 50 A for the 3’ end of the P-site tRNA. 
Chemical footprinting showed that movements of the tRNAs 
occurred first with respect to the 50S subunit. P/E and A/P hybrid 
tRNA states form spontaneously after peptide-bond formation, and 
only after the addition of the GTPase elongation factor G (EF-G) did 
movement occur with respect to the 30S subunit’® (Fig. 5). The 
hybrid states were visualized by careful sorting of ribosomal com- 
plexes’””*. The ribosomal subunits have rotated by ~6° relative to 
each other” (http://www.nature.com/nature/journal/v406/n6793/ 
extref/406318ail.mov), and this ‘ratcheted’ ribosome contains both 
A/P and P/E tRNAs. Single-molecule FRET studies show that 
although the ribosome is initially in the unratcheted state, it oscillates 
between the unratcheted and ratcheted states after peptidyl trans- 
fer®°*!, until EF-G binding stabilizes the latter. To demonstrate that 
the ratcheted state of the ribosome is related to translocation, FRET 
experiments have shown that viomycin, an antibiotic that inhibits 
translocation, traps the ribosome ina ratcheted state indistinguishable 
by FRET from that obtained when EF-G is bound*. Furthermore, 
FRET measurements show that concomitantly with the ratcheting 
of the subunits, the L1 stalk moves to interact with the newly deacy- 
lated P-site tRNA, as would be expected if it moves into a P/E hybrid 
state**. The formation of hybrid tRNA states is ordered, with the P/E 
tRNA state formed first, followed by the A/P state***’. 
The role of the E site in translocation. The adoption of the P/E 
hybrid state also explains why the E site may be necessary. The E site 
is known to have evolved before the divergence of the three king- 
doms, as the interactions of the E-site tRNA with the 50S subunit in 
archaea* and bacteria*® are similar. Because the E site binds deacy- 
lated but not peptidyl-tRNA, it is able to trap a hybrid P/E tRNA as 
soon as the P-site tRNA becomes deacylated, facilitating transloca- 
tion by the formation of hybrid states. This concept is supported by 
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Figure 5 | EF-G catalysed translocation. a—e, After peptidyl transferase, 
tRNAs shift spontaneously to the A/P and P/E states in a ratcheted ribosome 
(b), to which EF-G binds. After GTP hydrolysis and tRNA movement, 
ratcheting reverses (d) and EF-G dissociates (e). f, Ratcheting involves a 
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direct kinetic evidence®’ and the observation that tRNA modifica- 
tions that affect E-site binding also affect translocation**””. 

The crucial interactions between the ribosome and the terminal 

adenine of the E-site tRNA require only the 23S rRNA®**. Thus, the E 
site may have evolved before the evolution of proteins, such as trans- 
lational factors that facilitate translocation by hybrid states. 
Consistent with this theory, ribosomes with modifications in $12 
or S13 can perform translation even in the absence of elongation 
factors’. These proteins are at the subunit interface, and their 
modification presumably disrupts contacts between the two sub- 
units, facilitating their rotation relative to each other. 
The role of EF-G in translocation relative to the 30S subunit. The 
second step in translocation is the movement of tRNAs and mRNA with 
respect to the 30S subunit, which is catalysed by EF-G. It is generally 
accepted that GTP hydrolysis by EF-G precedes translocation”. 

CryoEM structures (for example, ref. 21) show that EF-G in the 
GTP-bound form on the ribosome has a significantly altered con- 
formation from that in the GDP or apo form in isolation”*”, and binds 
to the ratcheted state of the ribosome. A recent higher-resolution 
structure shows that the switch I and II regions of the GTPase domain 
become ordered on binding to the ribosome”’. Calorimetric studies 
suggest that EF-G undergoes a conformational change on binding 
GTP even before binding the ribosome, although full activation occurs 
only after ribosomal binding”®. The sarcin—ricin loop is the ribosomal 
element closest to the switch II region that is functionally important 
for GTPase activation in both EF-G and EF-Tu***”’. 

Ribosomes depleted of the L7/L12 stalk of the 50S subunit can bind 
EF-Tu or EF-G, but cannot efficiently activate GTP hydrolysis by the 
factors”’. L7 is an N-terminally modified form of L12, and by the 
association of its N-terminal domains exists as a tetramer in 
Escherichia coli or a hexamer in other species'®*’*. This multimer of 
L12 binds a single copy of L10 to form a stalk that is fully or partially 
disordered in high-resolution structures of the ribosome. The tip of the 
stalk containing the C-terminal domain of L12 seems to be too far from 
the ribosomal GTPase centre to be involved directly in stimulating 
hydrolysis (Fig. 1). The structure of a hexamer of L12 complexed with 
L10 has been determined and modelled into the structure of the 50S 
subunit'®. An increased rate of initial binding of GTPase factors was 
observed kinetically, which could be caused by several copies of the 
C-terminal domain of L12 effectively increasing the local concentra- 
tion of the binding sites. Because the N- and C-terminal domains of 
L12 are connected by a flexible linker, the latter could move with the 
factor close to the sarcin—-ricin loop. 
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rotation of the 30S subunit by approximately 6 degrees. See text for details. 
Note that transitions a-to-b, and c-to-d could be divided into sub-steps. No 
structure exists for c, so a domain movement was modelled to prevent EF-G 
and A/P tRNA from clashing. 
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The L11 region is in close proximity to the GTPase centre. This 
region also binds the antibiotics thiostrepton and micrococcin, which 
inhibit and enhance GTPase activity, respectively. In the structure of 
the 50S subunit bound with micrococcin, density for the C-terminal 
domain of L12 is observed adjacent to L11 and would be positioned to 
interact with EF-G”. Thus micrococcin could stabilize the binding of 
the C-terminal domain of L12 to the GTPase factor. 

How GTP hydrolysis leads to movement of mRNA and tRNAs and 
resets the ratcheted ribosome to its canonical form is still unclear. 
Presumably, a rearrangement in the ribosome induced by GTP 
hydrolysis allows movement of the tRNAs and mRNA*. Direct 
monitoring of mRNA showed that mRNA and tRNA movements 
occur at the same rate, and thus are directly coupled’®’. In the GDP 
form, the conformation of domain IV of EF-G places it in the A site of 
the 30S subunit’”'®’. Thus, GTP hydrolysis may drive translocation 
by preventing the reverse movement of tRNA and mRNA. GTP 
hydrolysis may also allow the ribosome to act as a helicase and 
unwind the secondary structures formed in mRNA™. 

As this review was going to press, the crystal structure of EF- 
G:GDP trapped in the post-translocational state on the ribosome 
by fusidic acid was determined'®. The structure shows that 
domain IV makes extensive interactions with the minor groove of 
the codon—anticodon base pairs at the P site, but not with the A site 
codon. Fusidic acid seems to trap EF-G in a conformation between 
that of the GIP and GDP states, and the binding of EF-G in this state 
stabilizes the L10-L7/L12 stalk as well as (indirectly) the L1 stalk. 


Termination of translation 


The elongation cycle continues until an mRNA stop codon moves 
into the A site, signalling the end of the coding sequence. A class I 
release factor recognizes the stop codon and cleaves the nascent poly- 
peptide chain from the P-site tRNA, resulting in the release of the 
newly synthesized protein from the ribosome. In bacteria, there are 
two class I release factors, RF1 and RF2. Whereas both factors recog- 
nize the UAA stop codon, UAG and UGA are only recognized by RF1 
and RE2, respectively. In eukaryotes, a single eRF1 that is unrelated to 
RFI or RF2 (refs 106, 107) recognizes all three stop codons. 
Tripeptide motifs PXT in RF1 and SPF in RF2 confer specificity for 
the codons UAG or UGA", whereas a universally conserved GGQ 
motif is implicated in peptide hydrolysis by release factors’**'”. 
Unlike the extended structure of eRF1 (ref. 107), the crystal struc- 
ture of isolated RF2 is compact, with the GGQ and SPF motifs only 
23 A apart’. However, low-resolution structures showed that when 
bound to the ribosome, release factors were in an open form and 
domain 3, containing the GGQ motif, inserted into the PTC''°'” 


Figure 6 | Termination of translation by class I release factors. a, Overview 
of class I release-factor binding to the 70S ribosome''*'’. The view shows 

RE2; the GGQ motif implicated in catalysis of peptide release at the PTC and 
the SPF motif implicated in stop-codon recognition at the decoding centre 
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(Fig. 6a). The high-resolution crystal form of the ribosome’ was used 
to solve structures of classI release factors bound to the bacterial 
ribosome'’*"''*, considerably advancing our understanding of the 
function of RF1 and RF2. 

Recognition of the stop codon. Release-factor binding causes the 
conserved decoding centre bases G530, A1492 and A1493 to change 
conformation and form crucial interactions (Fig. 6b)''*'®. The 
changes are distinct from those during decoding of tRNA in which 
the bases monitor base-pairing geometry (Fig. 3c—e)*’, in a con- 
formation incompatible with the binding of release factors''®. 
Instead, A1493 stacks on A1913 of 23S RNA, forming a new contact 
between the two subunits, and G530 stacks onto the third stop codon 
base. 

Although the structures can rationalize the specificity of RF1 and 

RF? for their respective stop codons, the tripeptide motifs implicated 
in conferring specificity of RF1 or RF2 (ref. 108) make only limited 
interactions with the stop codon (Fig. 6b)'!*"”’, so that their role is 
still unclear. Bases after the stop codon affect release-factor efficiency 
(for example, ref. 117), and a single mutation in RF2 distant from the 
stop codon allows it to recognize all three stop codons''. 
Furthermore, it has been shown that a mismatch in the P site (after 
a near-cognate tRNA has been accepted and translocated) leads to 
release factor recognition of sense codons with increased efficiency’. 
Thus, release factor function may involve a subtle balance between 
the energetics of binding and conformational changes, similar to that 
during decoding by tRNA. After stop-codon recognition, peptide 
release is triggered, but the mechanism of signal transduction in 
unclear. 
Catalysis of peptide release. The conserved GGQ motif is positioned 
in the PTC in a conformation only allowed because of its glycines 
(Fig. 6c)'*"», explaining the drastic reduction in the activity of 
release factors when they are mutated’’?'*', Furthermore, after 
release-factor binding U2585 shifts to expose the ester bond between 
the nascent peptide and P-site tRNA as is observed upon binding of 
A-site tRNA or its analogues”. This shift has been proposed to be 
catalytically important for both peptidyl transfer and release”’, as in 
both cases it exposes the ester bond to attack by a nucleophile. 
Further supporting this theory, a variety of nucleophiles are effective 
to varying degrees during catalysis of peptide release by release 
factors’. 

The glutamine in the GGQ motif makes a hydrogen bond through 
its main-chain amide with the 3’OH of A76 of deacylated P-site 
tRNA'*7, which represents the product state after catalysis and 
release of the nascent peptide chain. One group has proposed product 
stabilization as part of the catalytic mechanism of release factors, 


are highlighted in red. b, RF2 in the decoding centre, with the SPF motif 
highlighted. c, The PTC of the ribosome showing the GGQ motif of RF2 and 
deacylated P-site tRNA. 
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similar to certain proteases'!*''°. Although the mutation of the gluta- 
mine to alanine results in only a modest 5—10-fold reduction in the 
catalytic rate'*®'*', several other observations argue that it has a spe- 
cific role in catalysis. The glutamine is universally conserved, which for 
glutamine is rarely for purely structural reasons. It is required for 
viability in bacteria and in eukaryotes'’”’”. Whereas the mutation 
of the glutamine does not affect the rate of catalysis by other nucleo- 
philes, it does specifically affect the rate for peptide release by water’”’. 
Finally, the side-chain amino group of the glutamine is methylated, 
and the loss of methylation was shown to reduce the efficiency of 
peptide release’**. Consistent with these data, one of the structural 
studies proposed a model in which the glutamine side chain directly 
coordinates a water molecule for nucleophilic attack’. Structures of 
the substrate and transition state complexes will address this question. 
The role of RF3. The class II release factor RF3 accelerates the dis- 
sociation of classI factors from the ribosome after peptide release. 
The binding of RF3 to the ribosome—RF1/2 complex in the GDP form 
is thought to induce RF3 to exchange GDP for GTP’*. The crystal 
structure of RF3-GDP resembles EF-Tu in the GTP form'”. The 
same study showed that the binding of RF3 in the GTP form to the 
ribosome induces conformational changes likely to destabilize the 
binding of classI release factors, thus leading to their dissociation 
from the ribosome. 

Recycling of ribosomes before reinitiation. After hydrolysis of GIP 
by RF3, the factor dissociates from the ribosome, leaving mRNA and 
a deacylated tRNA in the P site. The ribosome must be recycled into 
subunits for a new round of protein synthesis to begin. In bacteria, an 
essential protein called ribosome recycling factor (RRF) works 
together with EF-G to carry out this process’”®. 

Chemical probing, cryoEM and crystallography all suggest similar 
interactions of RRF with the ribosome!?”"!**. However, the location 
of RRF in these studies would be incompatible with a P-site tRNA in 
the 50S subunit, as it would clash with the tip of domain I of RRF. 
Therefore, a probable model is that RRF binds to a ribosome contain- 
ing a deacylated hybrid P/E tRNA. EF-G would then bind, similar to 
the way in which it binds the ribosome in a pre-translocation state. 
However, this view is complicated by studies suggesting that RRF can 
even act on ribosomes with a peptidyl-tRNA’**'™*. CryoEM studies 
on the 50S subunit with RRF and both RRF and EF-G suggest the type 
of changes that might occur before and after RRF activity’”’, but it is 
unclear if they represent any specific state of recycling. So far, there is 
no structure of the entire ribosome with both EF-G and RRF. 

GTP hydrolysis seems to be required to promote the separation of 
subunits", yielding a 50S subunit and a complex of 30S, mRNA and 
deacylated tRNA, which requires IF3 to dissociate''. The action of IF3 
to remove mRNA and tRNA from the 30S subunit is attractive 
because it couples the last step in protein synthesis to the first, by 
preparing the 30S subunit for a new round of initiation (Fig. 2). 


Conclusions 


In this review, we have focused on the main aspects of bacterial trans- 
lation that are common to the synthesis of all proteins. Although even 
this basic pathway is very complicated, translation involves many 
other features that have also been the subject of structural and func- 
tional studies in recent years. These include the rescue of stalled ribo- 
somes, programmed frameshifting, the interaction of the nascent 
peptide with the exit tunnel, the modification of the peptide as it 
emerges from the ribosome, its folding and its transport across or 
insertion into membranes, and the regulation of translation. 
Nevertheless, one can only look back in wonder at the rate of progress 
in the last decade in our understanding of many key aspects of the 
translation pathway. 

This progress is likely to continue unabated, with cryoEM now 
yielding increasingly high resolution and an increasing number of 
functional states becoming amenable to crystallographic studies. 
Two areas that would particularly benefit from high-resolution struc- 
tures are initiation and translocation. These, as well as other stages of 
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translation involve GTPase factors. The very recent crystal structures 
of EF-Tu” and EF-G’” represent the first high-resolution structures 
of GTPase factors bound to the ribosome. By showing that such 
complexes are accessible to crystallography, they allow us to be 
optimistic that similar structures in other states will lead to an under- 
standing of how the ribosome specifically activates GTP hydrolysis by 
these factors at precise stages that differ for each GTPase. In addition 
to structural studies, increasingly sophisticated biochemical methods 
such as single-molecule studies will help to dissect the various steps of 
complicated processes. Although the structures of key states of the 
ribosome will be welcome, an understanding of how the ribosome 
proceeds from one state to the next will be aided by molecular 
dynamics, which is now able to tackle larger and more complex 
problems as a result of advances in computing and methodology. 
Finally, the extremely complicated field of eukaryotic translation, 
especially initiation, is sure to be increasingly targeted by biophysical 
and biochemical techniques. 
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Genome evolution and adaptation in a 
long-term experiment with Escherichia coli 


Jeffrey E. Barrick’*, Dong Su Yu***, Sung Ho Yoon’, Haeyoung Jeong’, Tae Kwang Oh”*, Dominique Schneider”, 
Richard E. Lenski’ & Jihyun F. Kim”® 


The relationship between rates of genomic evolution and organismal adaptation remains uncertain, despite considerable 

interest. The feasibility of obtaining genome sequences from experimentally evolving populations offers the opportunity to 
investigate this relationship with new precision. Here we sequence genomes sampled through 40,000 generations from a 
laboratory population of Escherichia coli. Although adaptation decelerated sharply, genomic evolution was nearly constant 
for 20,000 generations. Such clock-like regularity is usually viewed as the signature of neutral evolution, but several lines of 
evidence indicate that almost all of these mutations were beneficial. This same population later evolved an elevated mutation 


rate and accumulated hundreds of additional mutations dominated by a neutral signature. Thus, the coupling between 
genomic and adaptive evolution is complex and can be counterintuitive even in a constant environment. In particular, 
beneficial substitutions were surprisingly uniform over time, whereas neutral substitutions were highly variable. 


Adaptation has often been viewed as a gradual process. Darwin!' 
wrote that “We see nothing of these slow changes in progress, until 
the hand of time has marked the long lapse of ages...”. Theoretical 
work in quantitative genetics supported this view by showing that 
gradual adaptation would result from constant selection on many 
mutations of small effect”. However, an alternative model of evolu- 
tion on rugged fitness landscapes challenged this perspective’ and, 
later, empirical evidence was found for alternating periods of rapid 
phenotypic evolution and stasis in some lineages*®. The causes of 
variation in the rate of adaptation remain controversial and are 
probably diverse. They may include changes in the environment, in 
circumstances promoting or impeding gene flow, and in opportunities 
for refinement following the origin of key innovations or the invasion 
of new habitats, among other factors*"'. 

Genomic changes underlie evolutionary adaptation, but mutations— 
even those substituted (fixed) in evolving populations—are not neces- 
sarily beneficial. Variation in the rate of genomic evolution is also subject 
to many influences and complications. On the one hand, theory predicts 
that neutral mutations should accumulate by drift at a uniform rate, 
albeit stochastically, provided the mutation rate is constant’*. On the 
other hand, rates of substitution of beneficial and deleterious mutations 
depend on selection, and hence the environment, as well as on popu- 
lation size and structure'*"*. Moreover, the relative proportions of sub- 
stitutions that are neutral, deleterious and beneficial are usually difficult 
to infer given imperfect knowledge of any organism’s genetics and eco- 
logy, in the past as well as in the present. 

Experiments with tractable model organisms evolving in con- 
trolled laboratory environments minimize many of these complica- 
tions and uncertainties!!°. Moreover, new methods have made it 
feasible to sequence complete genomes from evolution experiments 
with bacteria'” *°. To date, such analyses have focused on finding the 
mutations responsible for particular adaptations. However, the 
application of comparative genome sequencing to experimental 


evolution studies also offers the opportunity to address major con- 
ceptual issues, including whether the dynamics of genomic and 
adaptive evolution are coupled very tightly or only loosely'®!”"?7!”, 


Genome dynamics and adaptation 


To examine the tempo and mode of genomic evolution, we 
sequenced the genomes of E.coli clones sampled at generations 
2,000, 5,000, 10,000, 15,000, 20,000 and 40,000 from an asexual 
population that evolved with glucose as a limiting nutrient for almost 
20 years as part of a long-term experiment. The complete sequence of 
the ancestral strain served as a reference for identifying mutations in 
the evolved clones, which we refer to by their generation abbrevia- 
tions 2K, 5K, 10K, 15K, 20K and 40K. 

Figure 1 shows all mutations identified in the evolved clones 
through 20,000 generations. The 45 mutations in the 20K clone 
include 29 single-nucleotide polymorphisms (SNPs) and 16 dele- 
tions, insertions and other polymorphisms (DIPs). Figure 2 shows 
that the number of mutational differences between the ancestral and 
evolved genomes accumulated in a near-linear fashion over this per- 
iod. Any deviation from linearity was not statistically significant 
based on randomization tests. 

The near-linearity of the trajectory for genomic evolution is rather 
surprising, given that such constancy is widely taken as a signature of 
neutral evolution’, whereas the fitness trajectory for this popu- 
lation” shows profound adaptation that is strongly nonlinear. In 
particular, the rate of fitness improvement decelerates over time 
(Fig. 2), which indicates that the rate of appearance of new beneficial 
mutations is declining, their average benefit is becoming smaller, or 
both. These effects, in turn, should cause the rate of genomic evolu- 
tion to decelerate. 

To understand this point, consider a simple model of the substi- 
tution of beneficial mutations in a clonal population of haploid 
organisms. A beneficial mutation has an initial frequency of 1/N, 
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Figure 1| Mutations found by sequencing genomes sampled between 
2,000 and 20,000 generations from an evolution experiment with E. coli. 
The outermost ring represents the genome sampled at 20,000 generations, 
and labels all genes with SNP mutations in coding (black) and intergenic 
(purple) regions, and those with DIP mutations including deletions (red), 
insertions (green), insertion sequence (IS) element insertions (blue), and an 
inversion between citC and gatZ (orange). Insertion sequences are 
transposable elements present in bacterial genomes. The next four rings, 
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Figure 2 | Rates of genomic evolution and fitness improvement. Blue circles 
show the total number of genomic changes relative to the ancestor in each 
sampled clone. The blue line represents a model where mutations accumulate 
uniformly over time. The light blue curves define the 95% confidence interval 
for this linear model. Green squares show the improvement of this 
population’s mean fitness relative to the ancestor over time, and the green 
curve is a hyperbolic plus linear fit of this trajectory. Each fitness estimate is 
the mean of three assays; most of the spread of points around the fitness 
trajectory reflects statistical uncertainty inherent to the assays. The inset 
shows the number of mutations in the 40,000-generation clone; the dashed 
curve approximates the change in the time course of genomic evolution after a 
mutator phenotype appeared by about generation 26,500. 
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from outer to inner, show mutations present in genomes sampled at 15,000, 
10,000, 5,000, and 2,000 generations. The innermost circle shows the 
genome position and scale in megabase pairs (Mb). Mutations that are off 
the line of descent to a genome sampled at 40,000 generations are capped 
with a circle. Only one mutation (kup/insJ-5), a 1-base-pair (bp) insertion 
near an IS150 element, shows an aberrant homoplastic distribution, being 
present in clones 10K and 20K but not 15K. Precise molecular details for all 
mutations are shown in Supplementary Tables 1 and 2. 


where Nis population size, and it confers a selective advantage S over 
its progenitor. Nevertheless, there is some probability that the 
mutant is lost by drift while it is rare. Given large N, small S and a 
Poisson distribution of offspring”, a beneficial mutant has a prob- 
ability of escaping extinction of ~25S. If the mutant survives, it takes 
t ~ log>(0.5.N)/S cell generations to increase to 50% frequency. These 
dynamics thus have two phases. In the first, a population waits for the 
appearance of a beneficial mutation that avoids extinction by drift 
with an expected waiting time of @ ~ 1/(2SNv), where v is the bene- 
ficial mutation rate. In the second phase, the mutant spreads by 
selection, becoming the new majority type after t generations. 

We can explore the relationship between rates of adaptation and 
genomic evolution under three scenarios. In the first, the substitution 
of any beneficial mutation has no effect on either the selection coef- 
ficient, S, or the beneficial mutation rate, v. The rates of fitness 
improvement and genome evolution should therefore be constant 
over the long term. Under the second scenario, the number of pos- 
sible sites for beneficial mutations is finite, so that v declines with 
increasing prior substitutions. The expected wait for a beneficial 
mutation becomes progressively longer, and the trajectories for 
adaptation and genomic evolution should decelerate in parallel. In 
the third scenario, the advantage of new beneficial mutations declines 
as fitness increases. The waiting and sweep times are both inversely 
proportional to S, so the total expected time between substitutions is 
also inversely proportional to S. The rate of fitness gain will decelerate 
with the reduced rate of beneficial substitutions as well as their 
declining effects, although the trajectories may not be parallel. 
Under all three scenarios, this model thus predicts declining rates 
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of both adaptive and genomic evolution or, alternatively, no decel- 
eration in either trajectory. 


Predominance of beneficial substitutions 


The simplest hypothesis that could explain the discrepancy between 
the nearly constant rate of genomic change and the sharply decel- 
erating fitness trajectory posits that only a small fraction of all sub- 
stitutions are beneficial, whereas most are neutral or nearly so'”"*. 
Accordingly, the beneficial substitutions would be concentrated in 
the early phase of rapid adaptation to the conditions of the experi- 
ment, but over time that initial burst would be swamped by the 
constant accumulation of neutral mutations by drift. However, four 
lines of evidence allow us to reject this explanation. 

First, under this drift hypothesis, one expects disproportionately 
more synonymous than non-synonymous mutations, because the 
former have no effect on protein sequence and thus are more likely 
to be neutral. In fact, all 26 point mutations we found in coding regions 
(22 in clone 20K, and 4 off the line of descent) are non-synonymous. 
The probability of observing no synonymous substitutions is only 
0.07% if the same base changes were distributed randomly in the cod- 
ing regions of the ancestral genome. 

Second, if mutations had spread by random drift, we would not 
expect to see mutations in the same genes in the other independently 
evolved populations of the long-term experiment, because only ~1% 
of the >4,000 genes in E. coli harbour mutations in the population 
studied here. By contrast, selection should target the same genes in the 
replicate lines because they started from the same ancestor and evolved 
in identical environments. Fourteen genes in which mutations were 
found in our study population have been sequenced in all the other 
populations after 20,000 generations. There is substantial parallelism, 
with three cases where all eleven other populations have substituted 
mutations in the same gene, nine additional genes with mutations in 
other lines, and only two cases where no other line has a mutation in 
the same gene (Table 1). In almost all cases, the evolved alleles differ 
between the populations, so accidental cross-contamination cannot 
explain these parallel changes. 

Third, under the drift hypothesis, we would expect many muta- 
tions in individual clones that did not become fixed in the population 
as a whole. However, almost all mutations in the earlier clones were 
present in clones from all subsequent generations. For example, four 
of the six mutations in clone 2K are present in all later clones, and all 
thirty-four mutations in clone 15K occur in clones 20K and 40K. 
Moreover, two of the thirteen mutations through 20K that are off 
the line of descent to the 40K clone occur in genes (pykF and nadR) 
where different mutations arose and were substituted later. Both of 
these genes also have substitutions in all of the other populations, so 
even these early unsuccessful alleles were probably beneficial, but 
were nonetheless eliminated because competing sub-lineages had 
even more beneficial mutations””®. 

Fourth, strains with these mutations should have no fitness 
advantage under the neutral drift hypothesis. To date, isogenic 
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strains with ancestral and derived alleles have been constructed at 
nine loci. In all but one case, the derived allele confers a significant 
advantage in competition (Table 2). The exception (ompF) might 
also be beneficial in combination with other mutations present in 
the genetic background in which it evolved, especially because 
parallel mutations arose in other populations (Table 1). By contrast, 
another study found that none of 26 random insertion mutations 
conferred a significant advantage in the same environment”. 


Other explanations for rate discordance 


Taken together, these four lines of evidence demonstrate that dis- 
cordance in rates of genomic and adaptive evolution in this experi- 
ment cannot be explained by assuming a preponderance of neutral 
substitutions. Another plausible explanation for the disparity is an 
ecological one. Fitness levels were measured, at all generations, in 
competition with the ancestor. In an evolution experiment with 
yeast, non-transitive ecological interactions gave rise to complex 
dynamics, such that the cumulative adaptation measured across suc- 
cessive episodes of selection was greater than that measured directly 
from start to finish’**. However, there is no significant discrepancy 
between the fitness gains summed over shorter intervals and the 
overall improvement measured from start to finish for the popu- 
lation in our study”’, allowing us to reject this hypothesis. 

Clonal interference occurs in asexual organisms when sub-lineages 
with beneficial mutations are driven extinct by competition with 
other sub-lineages bearing mutations that are even more bene- 
ficial?”°, and this process might contribute to the relatively constant 
rate of genomic change. In particular, the most beneficial mutations 
should dominate the early phase of evolution for large populations in 
a new environment”, but there are more potential mutations that 
confer small advantages than large ones*'*”*°. Thus, the supply of 
contending beneficial mutations may increase enough to sustain a 
uniform rate of overall genomic change. It may also be relevant that 
some early substitutions, which contributed the most to fitness 
improvement, involve global regulatory functions including the 
stringent response and DNA supercoiling*’**. These mutations have 
pleiotropic effects on the expression of many genes, and although 
these changes are beneficial on balance, some of their side effects are 
probably deleterious. These maladaptive side effects may introduce 
new opportunities for compensatory changes that restore appropri- 
ate expression of other genes and thereby further increase the supply 
of mutations conferring small advantages. 


Emergence of a hypermutable phenotype 


Several of the long-term populations evolved mutator phenotypes by 
20,000 generations, but the population in our study retained the low 
ancestral mutation rate to at least that time point****. In later genera- 
tions, however, this population exhibited a greatly elevated rate of 
genomic evolution (Fig. 2 inset). The 40K genome contains 627 
SNP and 26 DIP mutations (Supplementary Tables 3 and 4). As a 
consequence of the DIP mutations (including six new insertions of 


Table 1| Frequency of parallel mutations in 11 other independently evolved lines 


Gene or region Function Parallel mutations (%) Source 

nadR Transcriptional regulator 00 Ref. 42 

pykF Pyruvate kinase 00 Ref. 42 

rbs operon Ribose catabolism 00 Ref. 43 

malT Transcriptional regulator 64 Ref. 44 

spoT Stringent response regulator 64 Ref. 3 

mrdA Cell-wall biosynthesis 45 Ref. 42 

infB Translation initiation factor 2 A5* This study 

fis Nucleoid-associated protein 2y E. Crozat, D.S., unpublished 
topA DNA topoisomerase | 27 E. Crozat, D.S., unpublished 
pcnB Poly(A) polymerase 27 This study 

ompF Outer-membrane porin 13* This study 

rpsD 30S ribosomal protein 13* This study 

rpsM 30S ribosomal protein 0) This study 

glmU promoter Cell-wall biosynthesis (0) M. Stanek, R.E.L., unpublished 


* In addition to populations with substitutions, one or more others were polymorphic. 
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Table 2 | Tests of fitness effect in competition between isogenic constructs 
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Gene or region Fitness effect (%) Significance Source 

topA 133 eee Ref. 32 

pykF* 11.1 ae D.S., R.E.L., unpublished 

spoT 9.4 sai Ref. 31 

nadRt 8.1 ai D.S., R.E.L., unpublished 

glmU promoter 49 wer M. Stanek, T. Cooper, R.E.L., unpublished 
fis 2.9 ne Ref. 32 

rbs operont 2.1 i Ref. 43 

malT 0.4 oe Ref. 44 

ompFt =o 7 ia D.S., R.E.L., unpublished 

* For this mutation, isogenic constructs were made by replacing the evolved allele with the ancestral allele in the evolved genetic background. For all other mutations, isogenic constructs were made 


in the ancestral background. 


+ In these two cases, artificial deletions of the genes were constructed in the ancestral background and the fitness effects of those deletions are reported. 
t The deleterious effect of this mutation could indicate that it hitchhiked to high frequency. Alternatively, its fitness effect was tested only in the ancestral background, and it might be beneficial in 


association with one or more other evolved alleles. 


** P< 0.01; *** P<0.001. All significance levels are based on multiple independent competition assays. 


1S 150, three of IS 186 and one of IS1) the genome size of the 40K clone 
is 4.57 X 10° bp, representing a reduction of 1.2% from the ancestor. 
Of particular interest is a 1-bp insertion causing a frameshift mutation 
in the mutT gene. Defects in mutT specifically cause A'T—>C:G trans- 
versions®’, and 92.3% (553 of 599) of the new point mutations at 40K 
have this signature, significantly higher than the 23.5% (8 of 34) 
frequency among earlier mutations (one-tailed Fisher’s exact test, 
P=3X10 7°). 

We sequenced the site of the mutT frameshift in clones from other 
generations to determine the time-course of its fixation in the popu- 
lation. The mutation occurs in 0 of 3 clones tested at generations 
20,000, 25,000, 25,500 and 26,000; 1 of 3 clones at generation 26,500; 
2 of 3 clones at generations 27,000, 27,500, 28,000 and 28,500; and 3 
of 3 clones at generations 29,000, 29,500, 30,000, 35,000 and 40,000. 
Thus, the mutT mutant appeared by generation 26,500 and soon 
dominated the population. Luria—Delbriick fluctuation tests*’** 
indicate that the mutation rate to nalidixic acid resistance increased 
about 50- to 100-fold in later generations. 

The large number of new SNP mutations at 40K is presumably the 
result of drift coupled with the elevated mutation rate. Unlike the 
SNP mutations occurring before 20,000 generations, only a small 
fraction of these new mutations are likely to be beneficial. To test 
this prediction, we examine below the proportion of synonymous 
mutations after the mutator phenotype evolved to determine if it is 
consistent with a random distribution across sites. 


Synonymous changes and mutation rates 


The fact that no synonymous mutations fixed in the first 20,000 
generations is consistent with the low point-mutation rate in E. coli 
and population-genetic theory if those mutations are selectively 
neutral. According to theory, the expected rate of neutral substitu- 
tions equals the rate of neutral mutations'’*. We calculate an upper 
bound for the mutation rate from the Poisson distribution, which 
specifies a 5% chance that no synonymous substitutions would occur 
even if three were expected. That upper bound corresponds to a 
mutation rate of 1.6 X 10~!° per bp per generation given 20,000 
generations, a genome length of 4.63 X 10°bp, and the fact that 
20.4% of all possible point mutations are synonymous. This inferred 
rate lies between earlier estimates from mutation studies*® and com- 
parative analyses of E. coli and Salmonella enterica’’. 

In the 40K genome, by contrast, 13.9% (83 of 599) of the new base 
substitutions (those not in the 20K genome) are synonymous. This 
fraction is lower than would be expected if 20.4% of random sub- 
stitutions were synonymous. However, mutations in the 40K genome 
are highly skewed towards A-T->C-G transversions, which have a 
lower probability of causing synonymous changes than other point 
mutations. To reflect this mutational bias, we grouped point muta- 
tions into two categories: mutT, either AC or T->G; and non- 
mutT, all other base substitutions. These categories have probabilities 
of synonymous mutations of 11.3% and 22.1%, respectively, and they 
are represented by 553 and 46 new mutations, respectively, in the 40K 
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genome. The observation of 83 synonymous substitutions is slightly 
higher than the random expectation of 71.6 based on the sum of these 
two binomial distributions, although the excess of synonymous 
changes is small and only marginally significant at best (one-tailed 
P= 0.105). The small excess of synonymous substitutions implies that 
a high proportion of late-arising non-synonymous changes are also 
neutral or nearly so under the conditions of the evolution experiment. 

We can also use the number of synonymous substitutions to esti- 
mate the point mutation rate after the mutator phenotype evolved. 
The precise time of origin for the mutT subpopulation is unknown, 
but it was present in the 26,500-generation sample; we assume it 
arose at generation 25,000 for this estimation. As previously noted, 
neutral mutations should accumulate at a rate equal to their under- 
lying mutation rate. The lineage leading to the 40K clone accumu- 
lated all or almost all of its 83 synonymous substitutions after it 
became a mutator. Given roughly 15,000 generations, a final gnome 
length of 4.57 X 10° bp, and the fact that only 11.3% of mutT point 
mutations should produce synonymous changes, the 83 such instances 
imply a point-mutation rate of 1.1 X 107° per bp per generation. This 
rate is about 70-fold higher than the upper bound estimated before the 
mutator phenotype evolved. 


Perspective and outlook 


Genome re-sequencing in the context of experimental evolution pro- 
vides new opportunities for quantifying evolutionary dynamics. We 
observed discordance between the rates of genomic change and fitness 
improvement during a 20-year experiment with E. coliin two respects. 
First, mutations accumulated at a near-constant rate even as fitness 
gains decelerated over the first 20,000 generations. Second, the rate of 
genomic evolution accelerated markedly when a mutator lineage 
became established later. The fluid and complex coupling observed 
between the rates of genomic evolution and adaptation even in this 
simple system cautions against categorical interpretations about rates 
of genomic evolution in nature without specific knowledge of molecu- 
lar and population-genetic processes. Our results also call attention to 
new opportunities for population-genetic models to explore the long- 
term dynamic coupling between genome evolution and adaptation, 
including the effects of clonal interference, compensatory adaptation, 
and changing mutation rates. 


METHODS SUMMARY 

Evolution experiment. Twelve E. coli populations were propagated at 37 °C for 
6,000 days in minimal medium supplemented with limiting glucose at 25 mg]! 
by transferring 0.1 ml of culture into 9.9 ml of fresh medium each day**”’. The 
population designated Ara-1 is the focus of this study. Samples were stored 
periodically at -80 °C and later revived for sequencing and phenotypic analyses. 
Genome re-sequencing. On the basis of the sequence”® of the ancestral strain 
REL606 (GenBank accession number NC_012967.1), NimbleGen microarray- 
based comparative genome sequencing"' was first used to screen for mutations in 
the 2K and 20K clones. All six evolved clones were then sequenced to >50X 
coverage using the Illumina 1G platform. Mutations were identified using 
BRESEQ, a custom computational pipeline. Targeted sequencing was used to 
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confirm almost all mutations in the 20K and earlier clones, and to find parallel 
mutations in the other long-term populations. 

Mutation trajectory. We performed two randomization tests for deviations 
from linearity in the rate of genome evolution through 20,000 generations, 
one based on the cumulative distribution of mutations and the other on a 
time-weighted test statistic. Each test was performed using three data sets: the 
total number of mutations, only mutations on the line of descent, and only SNP 
mutations. None of the one-tailed tests was significant at P< 0.05. 

Fitness trajectory. Fitness levels were previously measured in competition assays 
between the Ara-1 population samples and a genetically marked variant of the 
ancestor’. We fit these data to a hyperbolic plus linear model: w= at/ 
(b+ t) + ct, where wis mean fitness, tis time, and a, band care free parameters. 
The inclusion of the linear term c significantly improves the fit relative to a 
hyperbolic-only (a, b) model (F,,17 = 7.715, P= 0.0129). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Genomic DNA isolation. Clones 2K (strain REL1164A), 5K (REL2179A), 10K 
(REL4536A), 15K (REL7177A), 20K (REL8593A) and 40K (REL10938) from the 
Ara-1 population of the E. coli long-term evolution experiment**”’ were revived 
from stocks kept at 80 °C in 15% glycerol by growth overnight in LB medium. 
DNA was harvested and purified from several millilitres of each culture with the 
Qiagen Genomic-tip 100/G kit. 

NimbleGen comparative genome sequencing (CGS). Genomic DNA from the 
2K and 20K clones was sent to NimbleGen Systems for comparative genome 
sequencing. Tiling arrays with 29-mer probes optimized for length, melting 
temperature, and mismatch position based on the REL606 ancestral genome 
sequence*’ were used to detect mutational differences between the ancestor 
and each evolved clone. Data were visualized using SignalMap version 1.8. All 
putative SNP sites and candidates for DIP variations (deletions, insertions and 
inversions) were checked by capillary Sanger sequencing of PCR products amp- 
lified directly from the genome. The NimbleGen CGS approach did not find 
several mutations that had been previously discovered in evolved clones (see 
later), indicating that a high false-negative rate was an impediment to this 
approach. 

Illumina whole-genome shotgun (WGS) re-sequencing. Genomic DNA samples 
from all six Ara-1 clones were sequenced on a 1G Genome Analyzer (Illumina) by 
Macrogen. Standard procedures produced data sets of opposite-strand, mated 
36-bp read pairs with inserts averaging ~ 135 bp and calibrated base quality scores. 
Mutations in each genome were then identified using BRESEQ, a pipeline imple- 
mented in Perl for analysing bacterial WGS re-sequencing data (J. E. Barrick, 
unpublished algorithm). The average coverage at positions with only unique read 
alignments in each of the six sequenced genomes was between 55X and 65X. More 
than 99.9% of the positions outside of repeated regions in each genome had at least 
tenfold coverage. These levels are sufficient to ensure with great confidence that 
SNP mutations at virtually all positions in the reference genome would be dis- 
covered if present. For additional verification of mutations predicted by BRESEQ, 
we also used the software MAQ* (version 0.7.1), which predicted the same set of 
base substitutions in each WGS re-sequencing data set. 

BRESEQ pipeline for mutation discovery. BRESEQ analyses gapped matches 
between each read sequence and the reference genome produced by MUMmer*® 
(version 3.20) with minimal exact match and extension requirements of 14 bp. 
For each read, it first determines the set of best alignments that are not contained 
within other matches and that have fewer mismatches than other alignments 
with the same endpoints. Of these matches, those that do not contain at least 
three of the four possible DNA bases or where 80% or more of the length of the 
match is a single base are eliminated. These nearly homopolymeric matches are 
typically spurious read sequences. For remaining matches that have a unique 
best match in the reference genome, internal ambiguities in gapped align- 
ments are systematically re-aligned to consistent reference genome positions 
by using a Needleman—Wunsch algorithm’’. The ends of each alignment are 
also trimmed whenever it is possible that mutations introducing small indels 
would be compatible with an alternative, equally valid alignment to the reference 
sequence. 

On the basis of unique alignments, BRESEQ predicts base changes and short 
indels from the calibrated base quality scores that support and contradict each 
candidate mutation. New sequence junctions (such as those produced by dele- 
tions) are predicted from hybrid reads consisting of two regions with best 
matches to discontinuous sites in the reference genome. The junction prediction 
procedure involves first assembling a list of candidate junction sequences com- 
patible with these matches, then re-querying hybrid reads against these candi- 
dates with MUMmer, and finally predicting consensus junctions from reads that 
match the new candidates better than they match any portion of the original 
genome. Additionally, deletions too large to be identified as gaps in individual 
read alignments are predicted by taking all reference positions with zero unique 
coverage and propagating outward until the unique coverage at a position 
exceeds an arbitrary cutoff. 

BRESEQ generates HTML output files with the genomic contexts of putative 
mutations annotated using BioPerl**. These tables are linked to alignments of 
read sequences so that information supporting and contradicting each predicted 
mutation can be examined in more detail by the user. Additionally, coverage 
histograms are generated for the entire genome and for each large deletion with 
the R statistics package*’. These output files were used to determine the precise 
extent of deletions and the locations of new sequence junctions. BRESEQ per- 
forms a reference-based comparison, rather than de novo assembly and therefore 
this procedure may not reliably detect point mutations, indels and genomic 
rearrangements involving repeated sequences that occur at several locations in 
the genome, or short tandem repeats that approach or exceed the 36-bp size of 
the input reads. 
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Mutation identification. Supplementary Table 1 shows the precise genomic 
locations and other details for all 34 SNP mutations found in the genomes of 
the Ara-1 2K, 5K, 10K, 15K, and 20K clones. Supplementary Table 2 provides 
details for the 21 DIP mutations identified in these genomes, which include a 
large inversion, three 1-bp insertions, three 1-bp deletions, four more extensive 
deletions, and ten IS element insertions. 

The following seven SNP mutations were discovered before genome re- 
sequencing: mrdA (ref. 42), ompF/asnS (D. Schneider, unpublished), topA 
(ref. 32), yhdG/fis (ref. 32), malT (ref. 44), spoT (ref. 31) and 20K nadR 
(ref. 42). Four of these known SNP mutations (mrdA, yhdG/fis, spoT and 20K 
nadR) were found independently using NimbleGen CGS, but the other three 
were not. Of the remaining 23 SNP mutations present in either the 2K or 20K 
genomes, NimbleGen CGS identified all except five (insL-2/lon, nagC, 2K pykF, 
insB-15 and hypF). By contrast, analysis of lumina WGS data identified all seven 
of the known mutations and 33 of the 34 total SNP mutations reported. The final 
SNP mutation (insB-15) is within a multicopy IS element and was found in this 
study during additional Sanger sequencing. We also verified that the kup/insJ-5 
1-bp insertion on the border of an IS 150 element shows an aberrant homoplastic 
distribution, being present in clones 10K and 20K but not 15K, with additional 
targeted sequencing. 

Five DIP mutations were discovered before genome re-sequencing: A(nmpC- 
ECB_00513) (ref. 50), inv(citC-gatZ) (ref. 50), pykF::1S150 (ref. 50), glmU/atpC 
(M. Stanek and R.E.L., unpublished), and A(kup-yieO) (ref. 43). Four of these 
mutations were found independently using NimbleGen CGS, with the exception 
being the 1-bp insertion in the glmU/atpC intergenic region. Five of the twelve 
other DIP mutations present in the 2K and 20K genomes were first discovered 
using NimbleGen CGS (ynjl:IS150, A(manB-cpsG), kpsD::1S150, AgltB, 
fimA:1S 186), and the 1-bp insertion in the kup/insJ-5 intergenic region was 
found during additional Sanger sequencing. Analysis of Illumina WGS data 
identified 20 of the 21 DIP mutations reported, all except inv(citC-gatZ). New 
IS-insertions predicted by WGS re-sequencing were confirmed as size changes in 
PCR-amplified fragments. 

We identified a total of 627 SNP and 26 DIP mutations in the 40K clone 
genome (Supplementary Tables 3 and 4). In addition to the 15 DIP mutations 
in the 20K clone that are on the line of descent, the 11 other DIP mutations 
include four IS-insertions, two 1-bp insertions, one 6-bp insertion, one 1-bp 
deletion, one 61-bp deletion, and two larger deletions of roughly 7 and 22 kb. 
Only two mutations that are present in the 20K clone (tdcR/yhaB and nrdE) are 
not found at 40K. One of the additional IS-insertions in the 40K clone is an IS 186 
element in the nupC/yfeA intergenic region at the exact site where one occurs in 
the 5K clone. Because this mutation is missing in all of the sequenced clones from 
intervening generations, this insertion is the second example of a homoplastic 
change that evidently originated independently in two sub-lineages. Also, one of 
the new 1-bp insertions adds another G directly adjacent to the previous kup/ 
insJ-5 1-bp insertion. We did not further confirm most of the predicted muta- 
tions in the 40K genome owing to the large number, but their quality is on par 
with the mutations that were correctly predicted in the earlier clones from WGS 
data. 

Parallel mutations. We PCR-amplified and sequenced the infB, pcnB, ompF, 
rpsD and rpsM genes of three clones isolated at 20,000 generations from each of 
the 11 other experimental populations. Mutations were counted as parallel 
changes if they were within the protein coding sequence or upstream promoter 
elements, but not if they were downstream of the reading frame. 

Fluctuation tests. We performed Luria—Delbriick fluctuation tests* to confirm 
that the Ara-1 population evolved an elevated mutation rate. Bacteria were 
revived from frozen stocks by growth overnight in LB medium. After dilution 
and 24h of re-growth in Davis minimal medium supplemented with 25 mg]! 
glucose, we inoculated 24 replicate 10-ml cultures of Davis minimal medium 
with 250 mgl' glucose with 100-1,000 cells. After 24h of growth to stationary 
phase, these cultures were concentrated by centrifugation and plated on LB agar 
containing 20 ug ml! nalidixic acid. The mutation rates to resistance of the 
mixed populations archived at 20,000, 30,000 and 40,000 generations were esti- 
mated as 5.8 X 10 '°, 1.7 10° and 6.3 X 10° per cell division by the max- 
imum likelihood method using a custom Perl script. These values are roughly 4, 
120 and 45 times the mutation rate of 1.4 X 10 '° per cell division estimated in 
the same experimental block for the ancestral strain. 

Tests of constant rate of genomic evolution. We examined three data sets 
consisting of the total number of SNP and DIP mutations in the 2K, 5K, 10K, 
15K and 20K genomes (TOT: 6, 15, 28, 36, 45), only those changes on the line of 
descent to the 40K clone (LOD: 4, 12, 22, 36, 43), and only the SNP mutations in 
each genome (SNP: 3, 9, 16, 22, 29). The LOD subset represents the estimated 
rate at which mutations fixed in the population; it excludes the effects of 
sampling within-population variation, which are unknown and may fluctuate 
over time given the effect of selective sweeps in purging variation. The SNP subset 
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represents those mutations that are most commonly modelled in theoretical stud- 
ies of evolutionary biology. We tested the hypothesis that the rate of mutation 
accumulation was linear in two ways. 

Cumulative distribution test. We used a randomization test to determine 
whether there was evidence that the number of mutations observed in the 2K, 
5K, 10K, 15K and 20K genomes was inconsistent with an underlying probability 
distribution giving a linear increase over time. For each data set, 10 million 
simulated data sets were generated by randomly redistributing 20K mutations 
with a uniform probability at times between 0 and 20,000 generations. The con- 
fidence intervals shown in Fig. 1 and also P-values for deviations from linearity at a 
given generation were determined directly from these simulated distributions of 
mutation number over time. This test is similar in design to the Kolmogorov— 
Smirnov test, but it takes into account the uneven spacing of the observations of 
mutation number with respect to time and enforces stricter confidence limits on 
observations near the ends of the cumulative distribution. If any point from the 
observed genomic data set fell outside of the 95% confidence interval then the null 
hypothesis of linearity would be rejected. The most extreme deviations are not 
significant even by a one-tailed test for observing more mutations than would be 
expected from the uniform distribution for any of the three data sets: TOT (10K, 
P=0.052), LOD (15K, P = 0.092), SNP (5K, P = 0.227). 

Time-weighted test statistic. We tested the apparent skew towards excess muta- 
tions in early generations using a time-weighted test statistic equal to the sum- 
mation of the number of new mutations observed in each subsequent genome 
multiplied by the generation at which that genome was sampled. For example, 
the value of this test statistic for the TOT data set is 487,000 (equal to 
6 X 2,000 + 9 X 5,000 + 13 X 10,000 + 8 X 15,000 + 9 X 20,000). We determined 
the significance of the observed value of the test statistic for each data set using a 
randomization test implemented with the STATISTICS101 resampling program 
(http://www.statistics101.net/). The mutations at 20K were redistributed 
randomly with a uniform probability distribution with respect to time to generate 
randomized data sets consisting of counts of mutations occurring by 2,000, 5,000, 
10,000, 15,000 and 20,000 generations. Then, one-tailed P-values were assigned to 
each data set by comparing the observed value of the test statistic to the distri- 
bution of simulated test-statistic values obtained from performing the randomi- 
zation procedure 10 million times. The skew of excess mutations towards early 
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generations, as measured by this test statistic, is not significant for any of the three 
data sets: TOT (P = 0.066), LOD (P= 0.256), or SNP (P= 0.299). 

Lack of early synonymous substitutions. In the 2K, 5K, 10K, 15K and 20K clone 
genomes, 26 different point mutations within protein reading frames were 
observed, and all of these altered the encoded amino acid. Using the total codon 
frequencies in the ancestral genome sequence tabulated using a custom Perl 
script, we calculated the chances that each of the 12 possible nucleotide substitu- 
tions would result in a non-synonymous substitution if it occurred at a random 
coding position (A—C: 0.893; AG: 0.758; AT: 0.898; CA: 0.762; CG: 
0.808; CT: 0.559; GA: 0.698; GC: 0.785; GT: 0.785; TA: 0.797; TC: 
0.600; TG: 0.840). From these probabilities, it is straightforward to calculate 
the 0.07% probability of observing zero synonymous mutations, by chance 
alone, given the observed distribution of the 26 coding base substitutions 
(AC: 2; AG: 2; AT: 3; CA: 2; CT: 3; GA: 7; GC: 1; GoT: 1; 
TA: 1; TG: 4). The observed excess of non-synonymous substitutions is 
highly significant. 

Synonymous substitution rates. Reading frames annotated in the ancestral 
genome sequence were extracted and analysed to calculate the probability that 
a random mutation would be synonymous using a custom Perl script. These 
calculations assume that the relative rates of different base changes are equal, 
except as indicated otherwise for the mutT mutator. 
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The role of DNA shape in protein-DNA 
recognition 


Remo Rohs'*, Sean M. West!*, Alona Sosinsky't, Peng Liu’, Richard S. Mann? & Barry Honig! 


The recognition of specific DNA sequences by proteins is thought to depend on two types of mechanism: one that involves 
the formation of hydrogen bonds with specific bases, primarily in the major groove, and one involving sequence-dependent 
deformations of the DNA helix. By comprehensively analysing the three-dimensional structures of protein-DNA complexes, 
here we show that the binding of arginine residues to narrow minor grooves is a widely used mode for protein-DNA 

recognition. This readout mechanism exploits the phenomenon that narrow minor grooves strongly enhance the negative 
electrostatic potential of the DNA. The nucleosome core particle offers a prominent example of this effect. Minor-groove 
narrowing is often associated with the presence of A-tracts, AT-rich sequences that exclude the flexible TpA step. These 
findings indicate that the ability to detect local variations in DNA shape and electrostatic potential is a general mechanism 
that enables proteins to use information in the minor groove, which otherwise offers few opportunities for the formation of 


base-specific hydrogen bonds, to achieve DNA-binding specificity. 


The ability of proteins to recognize specific DNA sequences is a 
hallmark of biological regulatory processes. The determination of 
the three-dimensional structures of numerous protein-DNA com- 
plexes has provided a detailed picture of binding, revealing a struc- 
turally diverse set of protein families that exploit a wide repertoire of 
interactions to recognize the double-helix'. Nucleotide sequence- 
specific interactions often involve the formation of hydrogen bonds 
between amino-acid side chains and hydrogen-bond donors and 
acceptors of individual base pairs. It has long been recognized that 
every base pair has a unique hydrogen-bonding signature in the 
major groove, but that this is not the case in the minor groove’. 
Thus, the expectation has been that the recognition of specific 
DNA sequences would take place primarily in the major groove by 
the formation of a series of amino-acid- and base-specific hydrogen 
bonds’. This ‘direct readout’ mechanism is consistent with observa- 
tions derived from three-dimensional structures of protein-DNA 
complexes, but it is far from the entire story. 

In many complexes, the DNA assumes conformations that deviate 
from the structure of an ideal B-form double helix*°, sometimes 
bending in such a way to optimize the protein-DNA interface°, and 
in some cases undergoing large conformational changes as in the 
opening of the minor groove in the complex formed between TBP 
and the TATA box’*. The term ‘indirect readout’ was coined” to 
describe such recognition mechanisms that depend on the propensity 
of a given sequence to assume a conformation that facilitates its bind- 
ing to a particular protein. The bases involved in such mechanisms 
need not be in contact with the protein and, for example, can be found 
in linker sequences that connect two half-sites that are themselves 
bound by individual protein subunits'®"’. 

We recently described an example ofa new readout mechanism, the 
recognition of local sequence-dependent minor-groove shape’ that is 
distinct from previously described indirect readout mechanisms. In 
this case, the sequence-dependence of minor-groove width and cor- 
responding variations in electrostatic potential are used by the 


Drosophila Hox protein Sex combs reduced (SCR) to distinguish small 
differences in nucleotide sequence’’. Here we report that this mech- 
anism is a widely used mode of protein-DNA recognition that 
involves the creation of specific binding sites for positively charged 
amino acids, primarily arginine, within the minor groove. Minor- 
groove narrowing is found to be correlated with A-tracts'*", usually 
defined as stretches of four or more As or Ts that do not contain the 
flexible TpA step’, but extended here to include as few as three base 
pairs (see later). Our results offer fundamentally new insights into the 
structural and energetic origins of protein-DNA binding specificity, 
and thus have important implications for the prediction of transcrip- 
tion-factor-binding sites in genomes. 


Arginine is enriched in narrow minor grooves 

The percentage of minor-groove contacts associated with each amino 
acid, classified according to the width of the minor groove, was deter- 
mined (Fig. la). Arginine constitutes 28% of all amino-acid residues 
that contact the minor groove and is notably enriched in narrow 
minor grooves, defined here by a groove width of <5.0 A (compared 
to 5.8 A in ideal B-DNA). Remarkably, 60% of the residues in narrow 
minor grooves are arginines, compared to 22% in minor grooves that 
are defined as not narrow—that is, width =5.0 A. A smaller enrich- 
ment is also observed for lysine but the overall population of lysines 
within the minor groove is much less than for arginine. 

Binding to the minor groove is a characteristic of many, but not all, 
protein superfamilies and a large subset of these contact a narrow 
minor groove (Table 1). Moreover, if the minor groove is contacted, 
arginines are likely to be involved, and the likelihood that an arginine 
will be present becomes even greater for narrow minor grooves 
(Supplementary Table 1). 

We compiled the DNA sequence preferences for protein-DNA 
complexes in which an arginine contacts a narrow minor groove 
(Fig. 1b). The figure shows that the base pair that has the shortest 
contact distance with the arginine guanidinium group has a 78% 


'Howard Hughes Medical Institute, Center for Computational Biology and Bioinformatics, Department of Biochemistry and Molecular Biophysics, Columbia University, 1130 Saint 
Nicholas Avenue, New York, New York 10032, USA. “Department of Biochemistry and Molecular Biophysics, Columbia University, 701 West 168th Street, HHSC 1104, New York, New 
York 10032, USA. +Present address: Institute of Structural and Molecular Biology, School of Crystallography, Birkbeck College, Malet Street, London WCIE 7HX, UK. 


*These authors contributed equally to this work. 


1248 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 461|29 October 2009 


60} gam Minor groove contacts 

504 ga Not narrow minor groove (25.0 A) 
gas Narrow minor groove (<5.0 A) 

404 


wo 
oO 


Contacts (%) 


NO 
oO 


Trp | 


0.36 0.43 0.40 0.41 0.31 | 0. j ; 0.40 0.43 0.36 


Figure 1| Amino acid frequencies in minor grooves. a, Histograms for each 
amino acid illustrate the frequency with which they are observed in any 
minor groove (green), in minor grooves with a width of =5.0 A (blue), and in 
narrow minor grooves of width <5.0 A (red). b, Frequency of AT (W) and 
GC (S) base pairs in sequences of 229 sites contacted by arginines in narrow 
minor grooves. The central base pair (boxed) is contacted by arginine. 
Frequencies are symmetrized by using both complementary strands. 


probability of being an AT and 22% of being a GC. Neighbouring 
base pairs in both the 5’ and 3’ directions surrounding the closest 
contacting base pair also have a strong tendency to be AT. Taken 
together, these data demonstrate that arginines tend to bind narrow 
minor grooves in AT-rich DNA. 


AT-rich sequences tend to narrow minor grooves 

We calculated minor-groove widths for all tetranucleotides contained 
in Protein Data Bank (PDB) structures for both free DNA (Fig. 2a) 
and DNA in complexes with proteins (Fig. 2b). There is a large spread 
of values due in part to end effects and to the effects of crystal packing, 
but some trends are evident nevertheless. For example, for free DNA 
structures most of the tetranucleotides with narrow minor grooves 
(width <5.0 A) are AT-rich (Fig. 2a and Supplementary Table 2a). 


Table 1| Protein superfamilies with minor-groove contacts 


Narrow minor grove 


SRF-like 
|HF-like DNA-binding proteins 


Not narrow minor grove 


DNA repair protein MutS, domain | 
Origin of replication-binding domain, 
RBD-like 

DNA/RNA polymerases 

Eukaryotic DNA topoisomerase I, 
amino-terminal DNA-binding fragment 
Ribonuclease H-like 

TATA-box binding protein-like 


Histone-fold 
DNA breaking-rejoining enzymes 


Zn2/Cys6 DNA-binding domain 
Homeodomain-like 

p53-like transcription factors 
Lambda repressor-like DNA-binding 
domains 

Winged helix DNA-binding domain 
Leucine zipper domain 

C-terminal effector domain of the 
bipartite response regulators 
Restriction endonuclease-like 
Glucocorticoid receptor-like (DNA- 
binding domain) 


Listed are SCOP superfamilies*® that have an arginine minor-groove contact within a distance of 
<6.0 A from the base. Superfamilies that use arginine to contact a narrow minor groove 
(<5.0 A) and those that use arginine to contact a not narrow minor groove (=5.0 A) are shown. 
Only superfamilies with a minimum of ten protein chains in PDB structures bound to DNA at 
least one helical turn long are included. The percentages of chains with minor-groove contacts 
vary considerably among SCOP superfamilies and are provided in Supplementary Table 1. 
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Similar behaviour is observed in protein-DNA complexes (Fig. 2b and 
Supplementary Table 2b). In contrast, tetranucleotides with wide 
minor grooves have a strong tendency to be GC-rich. 

The correlation between AT content and groove width is not un- 
expected given the fact that A-tracts are known to produce narrow 
minor grooves. However, TpA steps have a tendency to widen the 
minor groove’’, so it was of interest to determine whether the distinct 
properties of A-tracts and TpA steps are reflected in our tetranucleo- 
tide data set. We find that 67% of tetranucleotides composed only of 
AT base pairs have a narrow minor groove, but that this number 
increases to 82% if we exclude TpA steps so as to consider only 
A-tracts. Even A-tracts of length three have a strong tendency to 
narrow the minor groove. Forty-three per cent of the tetranucleotides 
with a minor groove width of <5.0 A have an A-tract length of three, 
a percentage that decreases to 11% of tetranucleotides with canonical 
minor-groove widths (between 5.0 and 7. 0A) and to 4% of tetra- 
nucleotides with minor grooves wider than 7.0 A (Supplementary Fig. 1). 
Furthermore, compared to other AT-rich sequences, A-tracts are 
specifically enriched in DNAs with narrow minor grooves (Supplemen- 
tary Fig. 1). Thus, although A-tracts are usually thought of as requiring 
four or more base pairs, in part because a minimum of four is required to 
rigidify the DNA”, this analysis shows that A-tracts as short as length 
three are positively correlated with narrow minor grooves. 


Arginines recognize enhanced electrostatic potentials 


The minor-groove width and electrostatic potential versus binding- 
site sequence for several complexes whose binding interface includes 
an arginine inserted into the minor groove is plotted in Fig. 3 and 
Supplementary Fig. 2. The correlation of width and potential as well 
as the tendency of arginines to be located close to minima in width 
and potential is evident. In this section we highlight a few specific 
examples of how arginine—minor-groove interactions are used in 
DNA recognition. 

Figure 3a represents the ternary complex of the Drosophila Hox 
protein Ultrabithorax (UBX) and its cofactor Extradenticle (EXD) 
bound to DNA”*. In this complex, Arg 5 of UBX, which is a conserved 
residue across all homeodomains, inserts into a narrow region 
formed by a 4-base-pair (bp) A-tract. An example of a long and very 
narrow A-tract that binds «2-Arg7 from the MATal-MAT«2 com- 
plex with DNA is shown (Fig. 3b)'’. In contrast, «2-Arg 4 inserts into 
a shallower region at one end of the A-tract, where there are local 
minima in width and potential that are smaller than at the Arg 7 site 
in the centre of the A-tract. The two POU domains of the mammalian 
OCT1 (also known as POU2F1)—PORE complex bind to two A-tracts 
(Fig. 3c) in which the minima are positioned in such a way as to 
provide binding sites for four arginines, two from each POU 
domain”. 

The location of these A-tracts with respect to other nucleotide 
sequence features can be used to generate specificity, as previously 
discussed for the Hox protein SCR”. In the case of SCR binding, the 
position of a TpA step within an AT-rich region has a critical role in 
binding specificity. A similar strategy is used by the motility gene 
repressor (MogR) in which two long A-tracts separated by a TpA step 
produce two arginine-binding sites’? (Fig. 3d). The unique shape 
recognized by these two arginines probably contributes to the posi- 
tion of the MogR-binding site along the DNA sequence. The overall 
tendency of TpA steps to widen the minor groove is most apparent 
when they are positioned between two A-tracts (as in SCR’ and 
MogR") where the TpA step acts as a ‘hinge’ between more rigid 
elements'>”°. In other contexts, owing to their flexibility, TpA steps 
can also be accommodated in narrow minor grooves”'. An example is 
provided by the bipartite DNA-binding domain of Tc3 transposase in 
which the arginines bind to a narrow region containing a TATA box” 
that displays enhanced negative electrostatic potential (Fig. 3e). 

Although less frequent, arginines also bind narrow grooves asso- 
ciated with non-A-tract sequences. Figure 3f summarizes features of 
the binding of the 434 repressor to its operator’’ that contains 7 bp 
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Figure 2 | Distribution of tetranucleotide sequences according to average 
minor-groove width. Tetranucleotides from structures with a minimum 
length of one helical turn for which minor-groove width can be defined are 
ordered by average minor-groove width (red). The widths of all 


that are all AT with the exception of a central CG. (The guanine 
amino group tends to widen narrow grooves, but a single GC base 
pair can be accommodated with only little disruption). 


Arginine minor-groove interactions in the nucleosome 


Figure 4a plots minor-groove width and electrostatic potential along 
the DNA sequence of the nucleosome core particle containing recom- 
binant histones and a 147-bp DNA fragment (PDB code 1kx5)™. 
There are 14 minima in minor-groove width corresponding to regions 
where the DNA bends so as to wrap around the histone core. As earlier, 
there is a marked correlation between width and potential. The vari- 
ation in width between the narrowest and widest regions is about 5 A, 

and the difference between the maxima and minima in electrostatic 
potential is about 6 kT e _' (Fig. 4a). Asa consequence, there should be 
a strong driving force for basic amino acids to bind to narrow regions 
and indeed arginines are found in 9 of the 14 minima. These arginines 
are shown in Fig. 4b where the nucleosomal DNA has been colour- 
coded by minor-groove width. (Although all 14 narrow minor-groove 
regions are contacted by arginines™ only 9 satisfy our criteria of 
<6.0A between arginine atoms and base atoms in the groove). A 
similar repeating pattern of narrow minor grooves that are contacted 
by arginines is seen in all 35 available nucleosome crystal structures 
(Supplementary Fig. 3a, b). 

Because short A-tracts narrow the minor groove and facilitate the 
bending of DNA, we would expect to see a periodicity of A-tracts in 
DNA sequences bound by nucleosomes in vivo. Previous analyses have 
focused on dinucleotide statistics*”®, although it has been known for 
some time that there is a periodic pattern of AAA and AAT trinucleo- 
tides in nucleosome core DNA’””’. An analysis of DNA sequences 
bound in vivo by yeast nucleosomes” reveals a clear periodicity for 
A-tracts of at least length three (denoted 3+, Fig. 4c). Moreover, 
nucleosomal DNAs contain, on average, 10.0 A-tracts of length 3+ 
(Fig. 4d). Periodicity is also detected for A-tracts of length 4+ and 
even 5+, although the number per nucleosome decreases to 4.1 and 
1.6, respectively (Supplementary Fig. 3). Thus, even though long 
A-tracts tend to be excluded from the nucleosome’, A-tracts of <5 
bp, when present, are used to facilitate bending of the DNA around the 
histone core. 
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tetranucleotides are shown (black) and the sequence, average width, and 
occurrence in our data set are given in Supplementary Table 2. a, The 59 
unique tetranucleotides from free DNA structures. b, The set of all 136 
unique tetranucleotides derived from protein-DNA complexes. 


To evaluate the effect of TpA steps, we compared the periodicities 
of A-tracts of length three to those of other trinucleotides composed 
only of AT base pairs. Trinucleotides that contain TpA steps have a 
much weaker periodic signal than A-tracts of length three, which 
exclude the TpA step (Supplementary Fig. 4). Together, this analysis 
suggests that many of the sequence periodicities in nucleosomal DNA 
reflect the presence of short A-tracts that lead to narrow regions in the 
minor groove, which are in turn recognized by a complementary set 
of arginines present on the surface of the nucleosome core particle. 


Effects of groove width on electrostatic potential 


The remarkable correlation between minor-groove width and electro- 
static potential (Figs 3 and 4) is primarily due to the properties of the 
Poisson—Boltzmann equation that have been extensively discussed in 
the literature*’. Biological macromolecules are less polarizable than 
the aqueous solvent and, in the language of classical physics, can be 
thought of as a low dielectric region embedded in a high dielectric 
solvent. Solutions of the Poisson—Boltzmann equation for DNA 
showed that contours of electrostatic potential owing to backbone 
phosphates follow the shape of the DNA and that the potentials are 
the most negative within the grooves*’. This effect is due to electro- 
static focusing, first observed for the protein superoxide dismutase”’, 
where the narrow active site focuses electric field lines away from the 
protein and into the high dielectric solvent. The same physical phe- 
nomenon produces enhanced potentials in grooves, accounting for 
the strong correlation described earlier. 

To establish the source of the effect in quantitative terms, we 
calculated the potentials for the MogR-binding site’? when the dielec- 
tric constant is set to 80 both inside the macromolecule and in the 
solvent (Fig. 5, dashed line) and for the case where the two dielectric 
constants are different (Fig. 5, solid line). Notably, a large enhance- 
ment of electrostatic potentials is only observed when the dielectric 
constant of the macromolecule and solvent are different, reflecting 
the focusing of electric field lines described qualitatively earlier. The 
small effect seen when the dielectric constant is the same results from 
the phosphates being closer to the centre of the groove when it is 
narrow (see Supplementary Fig. 5 for a breakdown of the contribu- 
tions to the net electrostatic potential). Both sets of calculations were 
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Figure 3 | Specific examples of minor-groove shape recognition by 
arginines. a-f, DNA shapes of the binding sites of UBX-EXD (PDB code 
1b8i)'* (a), MATal—MAT«2 (PDB code lakh)” (b), and OCT1—PORE (PDB 
code 1hf0)’* (c), the MogR repressor (PDB code 3fdq)”’ (d), the Tc3 
transposase (PDB code 1u78)” (e) and the phage 434 repressor (PDB code 
2or1)”* (f) are shown in GRASP surface representations*’*’, with convex 


carried out at physiological salt concentrations. Although ionic 
strength influences the absolute values of the potentials, the dielectric 
boundary effect remains essentially the same (Supplementary Fig. 6). 


Why are arginines preferred over lysines? 


It is surprising that there is a substantial population of arginines in 
the minor groove and a large enrichment when the groove is narrow, 
whereas the effects for lysines are more modest (Fig. la). Arginines 
have been known for some time to be enriched relative to lysines in 
protein—protein*’ and protein-DNA” interfaces, and the difference 
has generally been attributed to the ability of the guanidinium group 
to engage in more hydrogen bonds than the amino group of lysine”. 
To evaluate this idea we determined the number of hydrogen bonds 
formed by all the arginines and lysines in our data set that penetrate 
the minor groove. Surprisingly, on average, less than one hydrogen 
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surfaces colour-coded in green and concave surfaces in grey/black. Plots of 
minor-groove width (blue) and electrostatic potential in the centre of the 
minor groove (red) are shown below. Arginine contacts (defined by the 
closest distance between the guanidinium groups and the bases) are 
indicated. A-tract sequences are highlighted by a solid red line, the TATA 
box in e by a dashed line. 


bond is formed by either amino-acid side chain to DNA (0.9 for 
arginine and 0.6 for lysine), and the standard deviations are such that 
this difference is insignificant (Supplementary Table 3). 

An alternative explanation derives from the difference in the size of 
the cationic moieties of the two residues. According to the classical 
Born model, the solvation free energies of ions are proportional to the 
inverse of their radii*', suggesting that it is energetically less costly to 
remove a charged guanidinium group from water than it is to remove 
the smaller amino group of a lysine. To test this quantitatively, we 
calculated the change in free energy in transferring arginine and 
lysine from water to a medium of dielectric constant 2 (see 
Methods for details). The difference in the transfer free energies 
between the two residues ranges from 2.3 to 6.6 kcal mol ', depend- 
ing on the force field that was used, with lysine consistently having 
the higher value (Supplementary Table 4). These results indicate that 
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Figure 4 | Minor-groove shape recognition in the nucleosome. 

a, Correlation of minor-groove width of the nucleosome core particle (PDB 
code 1kx5)** (blue) and electrostatic potential (red). Arginine contacts 
(defined by the closest distance between the guanidinium groups and the 
bases) are indicated. A-tract sequences are highlighted by solid red lines. 
b, Schematic representation of the DNA backbone in the nucleosome 


the higher prevalence of arginines compared to lysines in minor 
grooves is due, at least in part, to the greater energetic cost of remov- 
ing a charged lysine from water than removing a charged arginine. 


Concluding remarks 


We have shown that there is a marked enrichment of arginines in 
narrow regions of the DNA minor groove that provides the basis for 
a new DNA recognition mechanism that is used by many families of 
DNA-binding proteins. A readout mechanism on the basis of groove 
width requires a connection between sequence and shape. This con- 
nection seems to be provided in part by A-tracts, which have a strong 
tendency to narrow the groove, producing binding sites for arginines 
that, when spaced appropriately on the protein surface, offer a com- 
plementary set of positive charges that can recognize local variations in 
shape. Arginines often insert into the minor groove as part of short 
sequence motifs (for example, Arg-Gln-Arg in the Hox protein SCR”, 
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Figure 5 | The biophysical origins of the negative potential of narrow minor 
grooves. Electrostatic potential in the minor groove of the MogR-binding 
site (PDB code 3fdq)"’, calculated in the presence of a dielectric boundary 
(¢ = 2 in solute and «¢ = 80 in solvent, solid line) and in the absence of a 
boundary (¢ = 80 in both solute and solvent, dashed line). 
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colour-coded by minor-groove width (red =4.0 A, pink >4.0 A and <5.0A, 
light blue >5.0 A and <6.0 A, dark blue >6.0 A), including all arginines that 
contact the minor groove. c, The distribution of A-tracts of length 3 bp or 
longer in 23,076 yeast nucleosome-bound DNA sequences”. d, Histogram of 
the occurrence of A-tracts of length three or longer in the same data set”’. 


Engrailed’®, Arg-Gly-His-Arg in MATal-MAT«2 (ref. 17), Arg-Arg- 
Gly-Arg in the nuclear orphan receptor” and Arg-Gly-Gly-Arg in the 
human orphan receptor’’), thus offering a variety of presentation 
modes that can contribute to the specificity of DNA shape recognition. 

The tendency of A-tracts to narrow the minor groove is primarily 
due to their ability to assume conformations, by propeller twisting, that 
lead to the formation of inter-base-pair hydrogen bonds in the major 
groove’. This network is disrupted by TpA steps as notably seen in the 
MogR-binding site’’. GC base pairs also have a tendency to widen the 
minor groove. The combination of these and other factors, such as the 
effects induced by flanking bases that are not directly located within the 
binding site”, can produce a complex minor-groove landscape that 
offers numerous possibilities for specific interactions with proteins. 
Indeed, minor-groove geometry is no doubt the result of the inter- 
action of intrinsic and protein-induced structural effects. 

The physical mechanisms described here are markedly evident in 
the nucleosome. The energetic cost of narrowing and bending the 
DNA in regions where the backbone faces inward will be reduced by 
the presence of short A-tracts that have an intrinsic propensity to 
assume such conformations and hence to bend the DNA”. 
Furthermore, the penetration of arginines into the minor groove at 
sites where the DNA bends and the groove is narrow’’*° provides an 
important stabilizing interaction. 

The variations in DNA shape observed in protein-DNA complexes 
often reflect conformational preferences of free DNA*'**'. Sequence- 
dependent conformational preferences have also been observed in 
computational studies!’ and, most recently, analysis of hydroxyl 
radical cleavage patterns shows that DNA shape is under evolutionary 
selection*’. Such observations indicate that the role of DNA shape 
must be taken into consideration when annotating entire genomes 
and predicting transcription-factor-binding sites. The biophysical 
insights described here, together with the increased availability of 
high-throughput binding data, offer the hope of major progress in 
understanding how proteins recognize specific DNA sequences and in 
the development of improved predictive algorithms. 
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METHODS SUMMARY 


Minor-groove geometry was analysed with Curves” for all 1,031 crystal structures 
of protein-DNA complexes in the PDB that have any amino acid contacting base 
atoms. Protein side chains contact the minor groove in 69% of those structures 
that have at least one helical turn of DNA. The probabilities for each amino acid to 
contact the minor groove were calculated for three groups of DNAs: total, narrow 
and not narrow. Proteins were grouped on the basis of 40% sequence identity. The 
properties of free DNAs and DNAs bound to proteins were analysed on the basis 
of the minor-groove widths of tetranucleotides, defined at the central base-pair 
step. 

All 35 crystal structures of the nucleosome available in the PDB were analysed. 
The analysis of nucleosomal DNA is based on 23,076 sequences in an in vivo yeast 
data set”’. The signal for a sequence motif in nucleosomal DNA is positive for a 
base pair when the base pair comprises any part of the sequence motif. 
Frequencies were symmetrized by analysing both complementary DNA strands. 

Electrostatic potentials were obtained from solutions to the non-linear 
Poisson—Boltzman equation at physiological ionic strength using the DelPhi 
program’'’. Regions inside the molecular surface of the DNA were assigned a 
dielectric constant of 2, whereas the solvent was assigned a value of 80. The 
potential is reported at a reference point at the centre of the minor groove. 
The reference point is located close to the bottom of the groove in approximately 
the plane of a base pair. This definition provides a measure of electrostatic 
potential as a function of base sequence. Solvation free energies of amino acids 
were calculated for extended conformations of arginine and lysine side chains 
and compared for four different force fields. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Calculation of minor-groove width. There were in total 1,031 crystal structures 
of protein-DNA complexes in the PDB as of 1 June 2008, in which the DNA was 
contacted by any amino-acid side chain at a distance <6.0 A from base atoms. Of 
these structures, 567 contained at least one helical turn, and no chemical modi- 
fications or deformations that prevent the calculation of minor-groove width. 
Groove geometry was analysed using Curves’ and minor-groove width was 
calculated as a function of base sequence by averaging all the Curves levels given 
for each nucleotide. 
Statistical analysis of protein-DNA contacts. Of the 567 protein-DNA struc- 
tures in our data set, 392 have at least one minor-groove contact defined by a 
distance of <6.0 A between any base and side-chain atoms. To avoid an over- 
sampling bias, proteins in this data set that shared =40% sequence identity were 
grouped to create 109 groups. The average number of contacts within each group 
was subsequently averaged over all 109 groups. These averages were divided by 
the sum of the average number of contacts for all amino acids to calculate the 
total minor-groove contacts, contacts in not narrow minor grooves (=5.0 A), 
and contacts in narrow minor grooves (<5.0 A), for each amino acid. 
Hydrogen-bond contacts between amino-acid side chains and the DNA bases 
and phosphates, water molecules and other protein atoms were identified with 
the HBplus program’. 
Structural annotation of DNA-binding proteins. The proteins in our data set of 
protein-DNA complexes were classified in SCOP** superfamilies. Proteins for 
which SCOP annotations were not available were annotated manually or using 
the ASTRAL database’. 
Correlation of tetranucleotide structure and sequence. Tetranucleotides in 
free DNA and protein-DNA complexes were used to analyse the base sequence 
propensity of minor-groove regions as a function of minor-groove width. The 
minor-groove width of a tetranucleotide was defined by the average of all 
Curves“ levels for groove width of the second nucleotide and the first level of 
the third nucleotide, which describes groove width at the central base-pair step. 
End regions and irregular tetranucleotides were excluded by requiring groove 
width definitions for at least one Curves level of each of the four nucleotides. 
Tetranucleotides from nucleosomal DNA were excluded from this analysis 
because the DNA is strongly deformed and the spacing between narrow regions 
is fixed at about one helical turn, thus adding a bias to the results. When applied 
to the 521 protein-DNA complexes in our data set, these criteria allowed the 
analysis of all 136 possible unique tetranucleotides. When applied to the 88 free 


nature 


DNA structures in our data set, the same criteria resulted in the analysis of 59 
unique tetranucleotides. To increase coverage for the free DNA data set, NMR 
structures were included if dipolar coupling data were used in the refinement. 
Propensity of sequence motifs in nucleosomes. The structural analysis of 
nucleosomes includes all 35 crystal structures in the PDB as of 1 May 2009. 
The sequence analysis was based on 23,076 nucleosome sequences of length 
146-148 bp in a yeast in vivo data set”. These nucleosome sites were scanned 
for sequence motifs such as A-tracts of different lengths, TpA steps, or other AT- 
rich regions. A given motif contributed to a positive signal for any base pair that 
overlapped that motif, thus longer motifs contributed signals to more nucleotide 
positions. The frequencies of all motifs were symmetrized by analysing both 
complementary strands. 

Calculations of electrostatic potentials. Electrostatic potentials were obtained 
from solutions to the non-linear Poisson-Boltzman equation at 0.145 M salt 
using the DelPhi program’. Partial charges and atomic radii were taken from 
the Amber force field*®. The interior of the molecular surface of the solute 
molecule (calculated with a 1.4A probe sphere) was assigned a dielectric con- 
stant of ¢ = 2, whereas the exterior aqueous phase was assigned a value of ¢ = 80. 
Debye-Hiickel boundary conditions and five focusing steps were used with a 
cubic grid size of 165 (a grid size of 185 was used for the nucleosome). 

The electrostatic potential is reported at a reference point close to the bottom 
of the minor groove approximately in the plane of base pair i. The reference point 
iis defined as the geometric midpoint between the O4' atoms of nucleotide i+ 1 
in the 5’—3’ strand, and nucleotide i— 1 in the 3’—5’ strand'*. Where the DNA 
strongly bends into the major groove the reference point can clash with the 
guanine amino group and cause large positive potentials (as seen in Fig. 4a for 
three regions of the nucleosome). 

Desolvation free energies were calculated with the DelPhi program*'”” for the 
transfer of arginine and lysine side chains in extended conformations from water 
to a medium of dielectric constant ¢ = 2. Transfer free energies were calculated 
for each of the two side chains based on charge distributions and atomic radii 
taken from Amber” and three other force fields (see Supplementary Table 4). 
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A y-ray burst at a redshift of z ~ 8.2 
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Long-duration y-ray bursts (GRBs) are thought to result from the 
explosions of certain massive stars’, and some are bright enough 
that they should be observable out to redshifts of z>20 using 
current technology’*. Hitherto, the highest redshift measured 
for any object was z= 6.96, for a Lyman-a@ emitting galaxy’. 
Here we report that GRB 090423 lies at a redshift of z ~ 8.2, imply- 
ing that massive stars were being produced and dying as GRBs 
~630 Myr after the Big Bang. The burst also pinpoints the location 
of its host galaxy. 

GRB 090423 was detected by the Burst Alert Telescope (BAT) on 
NASA’s Swift satellite® at 07:55:19 UT on 23 April 2009. Observations 
with Swift’s X-ray Telescope (XRT), which began 73 s after the trig- 
ger, revealed a variable X-ray counterpart and localized its position to 
a precision of 2.3 arcsec (at the 90% confidence level). Ground-based 
optical observations in the r, i and z filters starting within a few min- 
utes of the burst revealed no counterpart at these wavelengths 
(Supplementary Information). 

The United Kingdom Infrared Telescope (UKIRT), Hawaii, began 
imaging about 20 min after the burst, in response to an automated 
request, and provided the first infrared (2.15-L1m) detection of the 
GRB afterglow. In parallel, observations in other near-infrared (NIR) 
filters using the Gemini North 8-m telescope, Hawaii, showed that 
the counterpart was only visible at wavelengths greater than about 
1.2 um (Fig. 1). In this range, the afterglow was relatively bright and 
exhibited a shallow spectral slope, F, x v °°, in contrast to the deep 
limit on any flux at 1.02 im. Later observations from Chile using the 
MPI/ESO 2.2-m telescope, Gemini South and the Very Large 
Telescope (VLT) confirmed this finding. Such a sharp spectral break 
cannot be produced by dust absorption at any redshift, and is a 


textbook case of a short-wavelength “drop-out? source. The full 
grizYJHK spectral energy distribution (SED) obtained ~17h after 
burst gives a photometric redshift of z= 8.06 *? 53, assuming a simple 
intergalactic medium (IGM) absorption model. Complete details of 
our imaging campaign are given in Supplementary Table 1. 

Our first NIR spectroscopy was performed with the European 
Southern Observatory (ESO) 8.2-m VLT, starting about 17.5 h after 
the burst. These observations revealed a flat continuum that abruptly 
disappeared at wavelengths less than about 1.13 um, confirming the 
origin of the break as being due to Lyman-« absorption by neutral 


Figure 1| Multiband images of the afterglow of GRB 090423. The right- 
most panel shows the discovery image made using the UKIRT Wide Field 
Infrared Camera with the K filter (centred at 2.15 [im) at a mid-time of about 
30 min after the burst. The other three images (Y, 1.02 jum; J, 1.26 tm; 

H, 1.65 um) were obtained approximately 1.5 h after the burst using Gemini 
North’s Near Infrared Imager and Spectrometer (NIRI). The main panels are 
40 arcsec to a side, oriented with north to the top and east to the left. Insets, 
regions around the GRB, smoothed and at higher contrast. The absence of 
any flux in Y implies a power-law spectral slope between Y and J steeper than 
F, « v ‘8 and, coupled with the blue colour at longer wavelengths 
(J—H(AB) ~ 0.15 mag), immediately implies a redshift greater than about 
7.8 for GRB 090423. 
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hydrogen, with a redshift of z~ 8.2. The spectrum and broadband 
photometric observations, plotted over model data, are shown in 
Fig. 2. To obtain a more quantitative estimate of the redshift, we fit 
the spectra in redshift versus log[ Ny: (cm ”)] space, assuming a flat 
prior likelihood value for log[ Ny: (cm 7)] of between 19 and 23, 
which is broadly consistent with the distribution observed for 
lower-redshift GRB hosts’. We take the neutral fraction of the 
IGM to be 10%, although our conclusions depend only weakly on 
this assumption. We find the redshift from ISAAC spectroscopy to be 
z=8. (or An additional spectrum, recorded ~40h after the 
burst using the VLT’s Spectrograph for INtegral Field Observations 
in the Near Infrared confirms this analysis, yielding z= 8.3311 
(Supplementary Information). Fitting simultaneously to both spec- 
tra and the photometric data points gives our best estimate of the 
redshift, z= 8.23 Ree The low signal-to-noise ratio means we that 
are unable to detect metal absorption features in either spectrum— 
which would provide a more precise value of the redshift—and pre- 
vents a meaningful attempt to measure the IGM H1 column density 
in this instance. Our three independent redshift measures are con- 
sistent with that reported from a low-resolution spectrum obtained 
with the Telescopio Nazionale Galileo, La Palma’. 

The X-ray and NIR light curves of GRB 090423 (Fig. 3) show a 
broken power-law decay, with evidence of flares in both the X-ray 
and the infrared bands. The spectral energy distribution is consistent 
with the presence of the cooling break between the X-ray and optical 
bands. Apart from the unusually shallow spectral slope of the con- 
tinuum at wavelengths greater than 1.2 um, its afterglow properties in 
general appear to be consistent with the bulk GRB population (see 
Supplementary Information for further discussion). 
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With the standard cosmological parameters (Hubble parameter, 
Hj) =71 kms ? Mpc_'; total matter density, Qy= 0.27; dark- 
energy density, 2, = 0.73) a redshift of z= 8.2 corresponds to a time 
of only 630 Myr after the Big Bang, when the Universe was just 4.6% 
ofits current age. GRB 090423’s inferred isotropic equivalent energy, 
Eso = 1 X 10° erg (8-1,000 keV)", indicates that it was a bright, but 
not extreme, GRB. Thus, we find no evidence of exceptional beha- 
viour that might indicate an origin in a population III progenitor. 
First-generation stars are thought more likely to collapse into par- 
ticularly massive black holes, which in turn may produce unusually 
long-lived GRBs’; this seems not to be the case for GRB 090423. 

Indeed, we note that the y-ray duration of GRB090423, 
toy = 10.3 s, corresponds in the rest frame to only 1.1s, and the peak 
energy measured by BAT, 49keV, is moderately hard in the rest 
frame. Two other GRBs with z>5 (GRB 060927 and GRB 080913) 
had similarly short rest-frame durations, leading to some debate’? as 
to whether their progenitors were similar to those of the ‘short-hard’ 
class of GRBs, which are not thought to be directly related to core 
collapse. However, in the case of GRB 090423, a more careful extra- 
polation of the observed y-ray and X-ray light curves to lower red- 
shifts shows that its duration would have appeared significantly 
longer than suggested by naive time-dilation considerations’*. In 
any event, short GRBs probably have their origins in compact objects 
that are themselves the end products of massive stars, so the above 
conclusions will hold irrespective of the population from which 
GRB 090423 derives. 

It has long been recognized that GRBs have the potential to be power- 
ful probes of the early Universe’’. Their association with individual stars 
means that they serve as a signpost of star formation, even if their host 
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Figure 2 | The composite infrared spectrum of the GRB 090423 afterglow. 
SZ-band (0.98-1.1 um) and J-band (1.1—1.4 [1m) one- and two-dimensional 
spectra obtained with the VLT using the Infrared Spectrometer And Array 
Camera (ISAAC). Also plotted are the sky-subtracted photometric data 
points obtained using Gemini North’s NIRI (red) and the VLT’s High Acuity 
Wide field K-band Imager and Gemini South’s Gemini Multi-Object 
Spectrograph (blue) (scaled to 16h after the burst and expressed in 
microjanskys; 1Jy = 10 *°Wm *Hz '). The vertical error bars show the 
20 (95%) confidence level, and the horizontal lines indicate the widths of the 
filters. The shorter-wavelength measurements are non-detections, and 
emphasize the tight constraints on any transmitted flux below the break. The 
break itself, at an observed wavelength of about 1.13 jum, is seen to occur 
close to the short-wavelength limit of the J-band spectrum, below which, 


although noisy, the spectrum shows no evidence of any detected continuum. 
Details of the data-reduction steps and adaptive binning used to construct 
these spectra are given in Supplementary Information. A model spectrum 
showing the H1 damping wing for a host galaxy with a hydrogen column 
density of Nyy, = 10?! cm ? at a redshift of z = 8.23 is also plotted (solid 
black line), and provides a good fit to the data. Inset, allowing for a wider 
range in possible host Ny: values gives the 1a (68%) and 20 confidence 
contours shown. The fact that no deviation is seen from a power-law 
spectrum at wavelengths greater than 1.2 um, together with its shallow 
spectral slope, suggests that there is little or no dust along the line of sight 
through the GRB host galaxy (unless it is ‘grey’), consistent with the galaxy 
being relatively unevolved, and having a low abundance of metals. 
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Figure 3 | The X-ray and infrared light curves of GRB 090423. The axes 
show both observed (left-hand and bottom axes) and rest-frame (right-hand 
and top axes) quantities. The X-ray light curve was obtained using Swift’s 
BAT (cyan) and XRT (magenta), where the BAT observations have been 
extrapolated into the X-ray band. The fitted function represents a 
phenomenological model”* of the prompt and afterglow components. The 
infrared light curve was obtained using UKIRT, Gemini North, the MPI/ESO 
2.2-m telescope and the VLT. For consistency, although individual bands are 
plotted, they have been transformed into absolute magnitudes in the J band 
by means of the best-fitting SED (F,, x y °°) We show two illustrative fits 
to the infrared light curve. The solid line shows a plateau, breaking at 


galaxies are too faint to detect directly. Equally important, precise deter- 
mination of the hydrogen Lyman-« absorption profile can provide a 
measure of the neutral fraction of the IGM at the location of the 
burst’**°. With multiple GRBs at redshifts of z> 7, and the associated 
information about the IGM, we could therefore trace the process of 
reionization from its early stages”'. 

The high redshift of GRB 090423 has several crucial implications. 
Predictions based on extrapolating the global star-formation-rate 
density suggest that the observed rate of GRBs at z ~ 8 should be about 
40% of that at z~ 6 (ref. 12). Given the extra difficulty of identifying 
afterglows at higher redshifts, our finding is broadly consistent with 
these predictions. This is extremely encouraging for the prospects of 
future initiatives aimed at finding high-redshift GRBs and using them 
to locate and study primordial galaxies and measure the history of star 
formation at early times***. Furthermore, it is close to the redshift 
range in which the bulk of the cosmic reionization is thought to have 
taken place**’’. Very high-redshift GRBs for which infrared spectro- 
scopy was possible earlier, or which had brighter afterglows, would 
provide a direct probe of the progress of reionization. Finding such 
events is not an unreasonable hope: the most extreme GRBs have had 
afterglows that were intrinsically significantly brighter than that of 
GRB 090423 at the same rest-frame time**, and our first spectra were 
recorded more than 15h after the burst. Spectroscopy with a high 
signal-to-noise ratio would also provide a measure of the metallicity 
of the host galaxy, which potentially offers important clues to the 
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24,000 s to a steeper slope proportional to ~t ‘*. This underestimates the 
late time points, which must then be interpreted as a flare. The dashed line 
shows an alternative model, in which mid-time points at ~60,000s are 
instead interpreted as a flare; this is more consistent with the later time 
points and the X-ray break time at the end of the plateau. However, in this 
case the post-break slope, proportional to ~t °”, is much slower than the 
X-ray decay at comparable times, and it further requires a additional breakin 
the light curve to accommodate the late-time upper limit. Error bars are lo 
(68% confidence level) and the absolute magnitude scale corresponds to 
absolute AB magnitudes at 0.136 um. See Supplementary Information for 
further details. 


nature of any earlier generations of stars. Because the massive stars 
that yield GRBs are also likely to belong to the same population that is 
responsible for reionization, this suggests that GRBs will ultimately be 
used to constrain both sides—supply and demand—of the cosmic 
ionization budget in the early Universe. 
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GRB 090423 at a redshift of z ~ 8.1 


R. Salvaterra', M. Della Valle?*, S. Campana’, G. Chincarini’”, S. Covino’, P. D'Avanzo'”, A. Fernandez-Soto®, 
C. Guidorzi’, F. Mannucci®, R. Margutti’”, C. C. Théne’, L. A. Antonelli’, S. D. Barthelmy'°’, M. De Pasquale”’, 
V. D'Elia®, F. Fiore’, D. Fugazza', L. K. Hunt®, E. Maiorano’’, S. Marinoni'*'*, F. E. Marshall’®, E. Molinari’’’, 

J. Nousek?, E. Pian'®!”, J. L. Racusin’», L. Stella’, L. Amati!’, G. Andreuzzi!*, G. Cusumano’®, E. E. Fenimore??, 


P. Ferrero~’, P. Giommi~', D. Guetta’, S. T. Holland’®?*?, 


K. Hurley~*, G. L. Israel’, J. Mao', C. B. Markwardt!?*>”?, 


N. Masetti!’, C. Pagani’®, E. Palazzi'?, D. M. Palmer’®, S. Piranomonte’, G. Tagliaferri’ & V. Testa’ 


Gamma-ray bursts (GRBs) are produced by rare types of massive 
stellar explosion. Their rapidly fading afterglows are often bright 
enough at optical wavelengths that they are detectable at cos- 
mological distances. Hitherto, the highest known redshift for a 
GRB was z= 6.7 (ref. 1), for GRB 080913, and for a galaxy was 
z= 6.96 (ref. 2). Here we report observations of GRB 090423 and 
the near-infrared spectroscopic measurement of its redshift, 
z= 8.1*0}. This burst happened when the Universe was only about 
4 per cent of its current age’. Its properties are similar to those of 
GRBs observed at low/intermediate redshifts, suggesting that the 
mechanisms and progenitors that gave rise to this burst about 
600,000,000 years after the Big Bang are not markedly different 
from those producing GRBs about 10,000,000,000 years later. 

GRB 090423 was detected by NASA’s Swift satellite on 23 April 2009 
at 07:55:19 uT as a double-peaked burst of duration Too = 10.3 + 1.1s. 
As observed by Swift’s Burst Alert Telescope (BAT)*, it had a 15—150- 
keV fluence of F= (5.9 + 0.4) X 107’ ergcm ° and a peak energy of 
E, = 4878 keV (errors at the 90% confidence level). Its X-ray afterglow 
was identified by Swift’s X-ray Telescope (XRT), which began obser- 
vations 73s after the BAT trigger. A prominent flare was detected at 
t~ 170s in the X-ray light curve, which shows a typical ‘steep decay/ 
plateau/normal decay’ behaviour (Fig. 1). Swift’s Ultraviolet/Optical 
Telescope did not detect a counterpart even though it started making 
settled exposures only 77 after the trigger®. A 2-lum counterpart was 
detected with the United Kingdom Infra-Red Telescope, Hawaii, 
20 min after the trigger’. Evidence that this burst occurred at high 
redshift was given by the Gamma-Ray Burst Optical/Near-Infrared 
Detector (GROND, Chile) multiband imager (from the g’ band to 
the K band), which indicated a photometric redshift of z= 8.07)3 
(ref. 7). 

We used the 3.6-m Telescopio Nazionale Galileo (TNG, La Palma) 
with the Near Infrared Camera Spectrometer (NICS) and the Amici 
prism to obtain a low-resolution (R~ 50) spectrum of GRB 090423 
~ 14h after the trigger. NICS/Amici is an ideal instrument to detect 
spectral breaks in the continuum of faint objects because of its high 
efficiency and wide simultaneous spectral coverage (0.8—2.4 um). 


The spectrum (Fig. 2) reveals a clear break at a wavelength of 
1.1 um (ref. 8). We derive a spectroscopic redshift for the GRB of 
z= 8.1703 (ref. 9; see Supplementary Information, section 3), inter- 
preting the break as Lyman-o absorption in the intergalactic med- 
ium. No other significant absorption features were detected. This 
result is consistent, within the errors, with the measurement reported 
in ref. 7. 

At z~ 8.1, GRB 090423 has a prompt-emission rest-frame duration 
of only Too.¢= 1.13 £0.12s in the redshifted 15-150-keV energy 
band, an isotropic equivalent energy of Fis, = (1.0 + 0.3) X 10°° erg 
in the redshifted 8-1,000-keV energy band" and a peak energy of 
Eat = 437 + 55keV. The short duration and the high peak energy 
are consistent both with the distribution of long bursts, linked to 
massive stellar collapse, and with the population of short bursts, 
thought to arise from the merger of binary compact stars'!”’. 
Although the analysis of the spectral lag between the high- and low- 
energy channels in the BAT band is inconclusive about the classifica- 
tion of GRB 090423, the high E;,, argues in favour of a long GRB. The 
fact that GRB 090423 matches the Fj,,—E,,,¢ correlation of long GRBs 
within 0.50 further supports this classification’* (Supplementary 
Fig. 2). 

The rest-frame y-ray and X-ray light curves of GRB 090423 are 
remarkably akin to those of long GRBs at low, intermediate and high 
redshifts (Fig. 1), suggesting similar physics and interaction with the 
circumburst medium. The near-infrared light curve of GRB 090423 
~15h after the trigger shows a temporal decay with a power-law 
index of %~0.5, which is markedly different from the decay 
observed at X-ray energies during the same time interval, which 
has a power-law index of “x, ~ 1.3 (Supplementary Fig. 3 and Sup- 
plementary Information, section2). As for other lower-redshift 
GRBs, this behaviour is difficult to reconcile with standard afterglow 
models, although the sampling of the near-infrared light curve is too 
sparse for any firm conclusion to be drawn. 

The spectral energy distribution of near-infrared afterglow is well 
fitted by a power law with an index of f = 0.414 and an equivalent 
interstellar extinction of E(B— V) < 0.15, assuming dust reddening 
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Figure 1| Rest-frame y-ray and X-ray light curves for bursts at different 
redshifts. BAT and XRT light curves of GRB 090423 (red data) in the source 
rest frame. Errors in luminosity, L;,,, are at the 1a level; horizontal bars refer 
to the integration time interval. The XRT 0.3-10-keV light curve shows a 
prominent flare at a rest-frame time of t,¢ ~ 18 s (also detected by BAT), and 
a flat phase (with a power-law index of «x,; = 0.13 + 0.11) followed by a 
typical decay with a power-law index of «x,. = 1.3 + 0.1. We compare the 
light curves of GRB 090423 with those of seven GRBs in the redshift interval 


consistent with the Small Magellanic Cloud’. On the other hand, the 
analysis of the XRT data in the time interval 3,900—2 1,568 s suggests the 
presence of intrinsic absorption (in excess of the Galactic value) with an 
equivalent hydrogen column density of Ny(z) = 6.812$ X 1077cm * 
(90% confidence level; Supplementary Information, section 1). The 
low value of the dust extinction coupled with a relatively high value 
of N;ysuggests that GRB 090423 originates from a region with low dust 
content relative to those of low-z GRBs", but one similar to that of the 
high-z GRB 050904, for which z= 6.3 (ref. 15). Because the absorbing 
medium must be thin from the point of view of ‘Thomson’ scattering, 
the metallicity of the circumburst medium can be constrained to be 
>4% ofthe solar value, Z. The implication is that previous supernova 
explosions have already enriched the host galaxy of GRB 090423 to 
more than the critical metallicity, Z~ 107 *Z. (ref. 16), that prevents 
the formation of very massive stars (population III stars). Therefore, 
the progenitor of GRB090423 should belong to a second stellar 
generation. Its explosion injected fresh metals into the interstellar 
medium, further contributing to the enrichment of its host galaxy. 
Its existence empirically supports the cosmological models'”'* in which 
stars and galaxies, already enriched in metals, are in place only 
~600,000,000 yr after the Big Bang. Long GRBs are mostly associated 
with star-forming dwarf galaxies, which are thought to be the dom- 
inant population of galaxies in the early Universe'’. The fact that 
GRB 090423 appears to have exploded in an environment similar to 
that of low-z GRB hosts” is in agreement with this. 

The occurrence of a GRB at z~ 8 has important implications for 
the cosmic history of these objects*'*. Ina first, simple, approach, we 
can assume that GRBs trace the cosmic star formation history, given 
the well-known link between long GRBs and the deaths of massive 
stars*, and that GRBs are well described by a universal luminosity 
function. However, under these assumptions the expected number of 
bursts at z= 8 with an observed photon peak flux larger than or equal 
to that of GRB 090423 is extremely low: ~4 X 10 * in ~4 yr of Swift 
operation (Supplementary Fig.6 and Supplementary Information, 
section 4). Hence, one or both of the above assumptions may be an 


0.8—6.3. The bursts are selected from among those showing a canonical 
three-phase behaviour (steep decay/plateau/normal decay) in the X-ray light 
curve and without a spectral break between BAT and XRT, allowing the 
spectral calibration of the BAT signal into the 0.3-10-keV energy band. The 
light curves of GRB 090423 do not have any distinguishing features relative 
to those of the lower-redshift bursts, suggesting that the physical mechanism 
that causes the GRB and its interaction with the circumburst medium are 
similar at z ~ 8.1 and at lower redshifts. 


oversimplification™”*. The detection of a very high-z burst such as 
GRB 090423 could be accommodated if the GRB luminosity function 
were shifted towards higher luminosities according to (1 + z)° with 
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Figure 2 | TNG spectrum of the near-infrared afterglow. a, Spectrum of 
GRB 090423 obtained using the Amici prism on the TNG. The sharp break at 
wavelength 4 ~ 1.1 jim, which is due to H1 absorption in the intergalactic 
medium at the wavelength of the Lyman-z line, implies that z = 8.17} 3. The 
spectrum has been smoothed with a boxcar filter of width 4 = 25 pixels 
(where one pixel corresponds to ~0.006 im at / = 1.1 jm). The absolute 
flux calibration was obtained by matching the almost simultaneous GROND 
photometric measurements’. The wavelength calibration was obtained from 
the TNG archive and adjusted to the wavelengths of the main atmospheric 
bands. The error bar corresponds to 1o uncertainty as measured on the 
smoothed spectrum. The confidence level of the Lyman-x break detection is 
4c. See also Supplementary Information, section 3. b, Plot of 
transmittance, T (the atmospheric transparency convolved with the 
instrumental response). The system has a significant sensitivity down to 
0.9 um, and no instrumental or atmospheric effect could explain the abrupt 
flux break observed in the spectrum of GRB 090423. 
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6 2 1.5, or if the GRB formation rate were strongly enhanced in 
galaxies with Z < 0.2Z.. The requirement for evolution may be 
mitigated if we assume a very high star formation rate at z> 8. 
However, we note that the need for evolution is strongly supported 
by both the large number of Swift detections at z> 2.5 (ref. 24) and 
the number of bursts with peak luminosities in excess of 10°* ergs! 
(ref. 26). A possible explanation is that high-redshift galaxies are 
characterized by a top-heavy (bottom-light) stellar initial mass func- 
tion with a higher incidence of massive stars than in the local 
Universe’’, providing an enhanced number of GRB progenitors. 
Such objects could be the main agents responsible for completing 
the reionization of the Universe’?**~”. 
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Acceleration of neutral atoms in strong short-pulse 


laser fields 


U. Eichmann’, T. Nubbemeyer’, H. Rottke’ & W. Sandner?” 


A charged particle exposed to an oscillating electric field experi- 
ences a force proportional to the cycle-averaged intensity gradient. 
This so-called ponderomotive force’ plays a major part in a variety 
of physical situations such as Paul traps”* for charged particles, 
electron diffraction in strong (standing) laser fields*® (the 
Kapitza—Dirac effect) and laser-based particle acceleration’”. 
Comparably weak forces on neutral atoms in inhomogeneous light 
fields may arise from the dynamical polarization of an atom'*”; 
these are physically similar to the cycle-averaged forces. Here we 
observe previously unconsidered extremely strong kinematic 
forces on neutral atoms in short-pulse laser fields. We identify 
the ponderomotive force on electrons as the driving mechanism, 
leading to ultrastrong acceleration of neutral atoms with a mag- 
nitude as high as ~10" times the Earth’s gravitational accelera- 
tion, g. To our knowledge, this is by far the highest observed 
acceleration on neutral atoms in external fields and may lead to 
new applications in both fundamental and applied physics. 

The investigation has become possible through two recent findings 
concerning atomic ionization dynamics in strong laser fields. First, 
neutral atoms can survive a strong laser field in a (long-lived) excited 
state'’, in which they can be detected directly in an atomic beam by 
means ofa standard electron or ion detector’. Thus, any momentum 
transferred to the neutral atom can easily be detected. Second, 
according to the physical picture behind the excitation process, the 
excited electron behaves as a quasi-free electron during the laser 
pulse. More precisely, the excitation process can be viewed as a fru- 
strated tunnel ionization’ within the three-step model for strong- 
field ionization”. 

In the first step, the electron tunnels in the close vicinity of the 
maximum electric field of a laser cycle. The liberated electron is then 
driven by the laser field with an amplitude that slowly decreases with 
decreasing pulse intensity; in this way an active damping of the elec- 
tronic motion takes place. After the laser pulse the electron is left with 
a drift energy too low to overcome the Coulomb potential of the ion 
and is recaptured into a Rydberg state. The quivering quasi-free 
electron experiences the ponderomotive force during the laser pulse 
owing to the intensity gradient in the focused laser beam. We will 
show here that the quiver motion of the electron is partially converted 
into centre-of-mass motion of the neutral atom, leading to a sub- 
stantial acceleration. This results in a measurable momentum trans- 
fer to the atom despite the short interaction time in the femtosecond 
range. Remarkably, the ponderomotive effect is typically estimated to 
be negligible for these conditions'®*’” with, however, a few excep- 
tions'*. We note that the investigation relies on the highly selective 
process of excitation of neutrals in a strong laser field, where kin- 
ematic effects are imparted only through the gradient of the laser 
field. 

In the experiment we excite neutral He atoms in an effusive atomic 
beam using a perpendicularly intersecting focused laser beam. Using 


the detection technique (see the Methods) we measure the distri- 
bution of excited He atoms on a detector as shown in Fig. 1. If, during 
the laser pulse, no momentum is transferred to the atoms, we would 
expect a slightly enlarged projected image of the (laser-intensity- 
dependent) distribution of excited atoms in the laser beam on the 
detector, that is, a distribution that extends along the laser beam 
direction (z axis), typically within the Rayleigh length, but with a 
very narrow radial distribution (7p axis) of the order of the size of 
the laser beam waist. 

In Fig. 1a, however, we see a strikingly large radial distribution of 
excited atoms with a strong maximum in the laser focal plane (z= 0) 
that obviously stems from a deflecting radial force during the laser 
pulse. In Fig. 1b the cut along the z axis (black curve) shows two 
maxima at roughly half the laser peak intensity Ip/2, where the net 
production rate of excited helium atoms He* is apparently maxi- 
mum, whereas the He* signal at Ij shows a pronounced minimum. 
However, the loss of neutral excited atoms is largely due to their 
radial deflection. The full projection (red dashed curve) shows only 
a slight decrease in signal, indicating that even at the highest inten- 
sities He atoms are excited. The data are taken at a low beam target 
pressure of ~5x10~7 mbar. The radial deflection is unchanged 
when we increase the target pressure by more than a factor of 30. 
This excludes many-particle effects based on atom density or space 
charge as an origin of our observations. Furthermore, we emphasize 
that the radial distribution is unaltered whether the linear polariza- 
tion of the laser beam is in the direction of the atomic beam or 
perpendicular to it. In this respect the intensity-dependent force very 
much resembles the ponderomotive force acting on charged part- 
icles. The question arises whether we can conclude that the ponder- 
omotive force is responsible for the observed centre-of-mass motion 
of the neutral particle. 

To shed light on the underlying process we first recall that the 
ponderomotive force F, on a charged particle is given by (all equa- 
tions are in atomic units): 
fa 


F,=— 
4mo 


P 


V|Eo|” (1) 


Here, mand qare the mass and the charge of the particle, respectively, 
E(r, t) = Eo(r, t)expimt is the electric field, w is the field frequency 
and E,(r, t) is the slowly varying field amplitude. Hence, in view of 
our frustrated tunnel ionization model, both the ionic core and the 
electron experience a mass-dependent ponderomotive force during 
the laser pulse. As a consequence of the mass dependency, however, 
the ionic core remains practically unaffected while the electron 
experiences a non-negligible ponderomotive force. This, in turn, 
means that to the first approximation a ponderomotive force acts 
directly on the centre-of-mass motion of the atom and leaves the 
recapturing process unaffected. This can be shown more rigorously: 
we derive the centre-of-mass motion from the Lorentz force (see the 
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Figure 1| Deflection of neutral He atoms after interaction with a focused 
laser beam. a, Distribution of excited He* atoms on the detector (colour 
scale, in number of atoms). The laser beam direction is indicated by the 
arrow. b, Cut through the atom distribution along the laser beam axis (z axis) 
at rp = 0mm (black curve) and full projection on z axis (dashed red curve) 
and intensity along the z axis in units of the laser peak intensity 


online-only Methods). Recalling the key point of our investigation, 
that the electron remains bound after interaction with the laser pulse, 
we are able to observe the centre-of-mass motion. Furthermore, by 
solving the coupled Lorentz equations for the electron and the ion, 
including the Coulomb potential, we can directly reproduce the cap- 
ture process into bound Rydberg orbits and the force on the centre of 
mass. 

According to our model we can rewrite equation (1) for the centre- 
of-mass position R of the neutral atom: 


MR(t) V|Eo|” (2) 
Here, Mand m, (= 1 atomic unit) are the masses of the atom and the 
electron, respectively, and R(t) is the second derivative of the centre- 
of-mass position R with respect to time. To calculate the pondero- 
motive force explicitly, we assume a linearly polarized laser beam 
with a Gaussian spatial intensity distribution, which reads, in cylin- 
drical coordinates: 


4m.o 


=i 

(2) —2r 
ex! 

Zo P Te 


where 1 =wo,/1+(z/ ays Wo is the beam waist. Evaluating the 


I(r) =|Eo(r)|’ =o ( 14 (3) 


gradient in equation (2) with the intensity distribution given by 
equation (3), we obtain, for the radial component of the centre-of- 
mass position perpendicular to the laser beam direction: 


1)= per) (4) 


where f(t) is the laser pulse envelope, which we assume to be of the 
form f(t) = exp(-f’/ tT’), where t is the pulse width. From equation (4) 
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He* yield (arbitrary units) 
Ty = 6.9 X 10° W cm™ (blue curve). ¢, Cuts through the distribution at 
z= 0mm (red curve) and z = —2.7 mm (black curve). The black curve shows 
the velocity distribution of excited neutral atoms at a position unaffected by 
the ponderomotive force, showing essentially the ‘natural’ velocity spread, 


while the red curve shows the velocity gain through the ponderomotive 
force. 


we find that the maximum force along the radial direction scales as 
1) |. Similarly, one can show that it scales as z, ' along the laser beam 
direction. Because the Rayleigh length 2 is typically a factor of 100 
larger than the beam waist ro, the gradient and thus the ponderomo- 
tive force in the laser beam direction is much smaller than in the 
radial direction and can be neglected. (However, the situation would 
be very different if we used a short-pulse standing-wave laser field. 
We would then obtain a strong periodic intensity gradient on the 
scale of the laser wavelength, and might expect to see the Kapitza— 
Dirac effect for neutral atoms in an intense standing-wave laser field 
instead of electrons’’). 

To solve equation (4), we assume that the neutral atom does not 
move significantly during the laser pulse. Hence, we set r(t) = ron the 
right-hand side of the equation, which allows us to solve equation (4) 
analytically for any initial position of an atom in the laser beam. We 
will concentrate our analysis on atoms located at the half beam size 
19/2, which experience the maximum force. Solving equation (4) for 
these conditions by integrating over the full laser pulse, we find the 
maximum velocity Vax(Z): 


~ 2Ma2wo 


/mexp( —0.5) a 


3 
2 
+(3) 


If we evaluate equation (5) at the focal plane for He atoms exposed to 


(5) 


Vmax (Z) 


our focused laser beam at maximum intensity, we obtain a velocity of 
about 55ms ' from which accelerations of about 2 X 10'*g can be 
deduced. 

This exceeds the typical acceleration (deceleration) of neutral 
atoms” or molecules in external fields*!””. Compared to laser-cooling 
experiments in a continuous-wave laser field, for instance, which are 
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based on photon momentum transfer with a typical deceleration of 
about 10°¢ (ref. 20), the acceleration present in our experiment is eight 
orders of magnitude larger. It is, to the best of our knowledge, by far 
the largest acceleration of neutral matter by electromagnetic fields ever 
observed. 

Finally, we mention that a different but equivalent description of 
our process might be given in terms of an atom in the Kramers— 
Henneberger reference frame’, which is expected to exhibit stable 
configurations in a strong electromagnetic field. Our observations of 
accelerated neutral atoms seem to be a direct confirmation of the 
existence of this exotic type of stable atom. 

In Fig. 2 we show the results of a systematic investigation of He 
atoms exposed to laser light of different laser pulse lengths and laser 
pulse energy E;, such that the laser intensity is kept constant. We 
obtain neutral excited atom distributions similar to the one shown in 
Fig. 1. The maximum radial deflection of the atoms along the laser 
beam axis, which can be converted into maximum velocity Vinax(Z), 
can reliably be extracted from the experimental data to compare with 
equation (5). Fitting equation (5) to the data gives very good agree- 
ment, confirming the validity of our model. The fit contains only 
three free parameters: a small velocity offset, a common constant 
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Figure 2 | Maximum velocity vnax(z) gained by neutral He atoms. 

a-€, Vinax(Z) extracted from measurements at constant laser peak intensity 
Ip = 2.8 X 10'° W cm” (3.4 X 10° Wm”), but for different pulse energies 
E, and pulse durations tpwym: a, 600 [J, 40 fs; b, 900 11), 60 fs; ¢, 1.2 mJ, 80 fs; 
d, 1.5 mJ, 100 fs; e, 1.8 mJ, 120 fs. FWHM, full-width at half-maximum. The 
red curves are fits to the data based on equation (5). The fitted common 
beam waist wy = 16.0 um is in good agreement with an independent 
experimental determination of wo = 17.5 + 1.5 jum using the knife-edge 
method. It results, however, in a higher laser peak intensity, which is 
indicated in parentheses. f, Vax(z) for laser parameters E, = 1.8 mJ and 
Two = 40 fs corresponding to Ip = 8.3 X 10'° W cm (10 X 10° Wem”). 
The error bars are estimated from the accuracy of the maximum deflection 
we determined. 
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beam waist wo, and a scaling factor for the absolute velocity. The best 
choice of the latter implies that the acceleration must last over the 
whole pulse duration, confirming the assumption that the electrons 
are set free early during the laser pulse. 

Even then the observed velocities lie slightly above the theoretical 
prediction (see Fig. 3), which we attribute to absolute laser intensity 
uncertainties or a slightly non-Gaussian intensity distribution. 
Hence, we note that the radial velocity v,,,,(z) might serve as a 
valuable beam shape and intensity parameter along the laser beam 
axis. On the one hand it measures the peak intensity if the radial 
intensity distribution is known (for example, Gaussian), or, on the 
other hand, indicates systematic deviations from the Gaussian beam 
shape if the peak intensity is known (for example, from ionization 
experiments). Such information is otherwise rather difficult to 
obtain. 

We have also measured the radial deflection of a beam of Ne atoms, 
which is, however, smaller than in He, as expected owing to the larger 
mass. In Fig. 3 we show the maximum velocity Vnax(Z = 0) trans- 
ferred to Ne atoms exposed to a laser beam kept at a constant laser 
intensity of Ip = 2.8 X 10'° Wcm”, but at different pulse durations 
and energies. For comparison, the maximum velocities for He are 
also shown. The data clearly show the excellent quantitative agree- 
ment of the observed deflection with our theoretical predictions. 
Within the range of our achievable pulse durations the deflection is 
proportional to the mass and the time the force is acting on the atom. 

In conclusion, we have demonstrated ultrastrong acceleration of 
neutral atoms, using femtosecond laser pulses with intensities of up 
to Ib =10'° Wem”. We present a quantitative theoretical model 
based on the concept that a bound electron temporarily undergoes 
a quasi-free oscillatory quiver motion during the laser pulse while still 
being coupled to the ion by the Coulomb force. The inhomogeneous 
field of a focused laser beam causes a ponderomotive force on the 
electron, resulting in a centre-of-mass acceleration of the whole 
atom. Many new applications may be envisioned, considering that 
the mechanism transfers the momenta of a large number (presently 
of the order of 10°) of photons quasi-instantaneously onto neutral 
atoms, without any problems due to Doppler detuning, as in the case 
of resonant continuous-wave laser absorption. Atom optics, con- 
trolled atom deposition or controlled chemical reactions are just 
some of them. The acceleration process may be considerably refined 
by sophisticated spectral and temporal shaping of the accelerating 
laser beam, including the use of standing (or slowly moving) waves 
with much steeper field gradients, or the use of longer pulse durations 
for achieving substantially larger atomic momenta. 
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Figure 3 | Maximum velocity v,,,,(0) transferred to He and Ne at the focal 
plane as a function of the laser pulse duration at constant laser intensity. 
Vmax(0) is plotted for He (black squares) and Ne (red circles) measured at 
Ip = 2.8 X 10'° Wem” (3.4 X 10’ Wm”); see explanation in Fig. 2. The 
dashed black and red curves show the calculated velocities, respectively, 
using equation (5) with the fitted beam waist, but omitting the scaling factor. 
The full curves are fits to the data. The error bars are determined as in Fig. 2. 
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METHODS SUMMARY 


The observation of acceleration of neutral atoms in strong laser fields is based on 
the method of producing and detecting excited neutral atoms in strong laser 
fields introduced in ref. 14. In the online-only Methods we give a detailed 
description of how to adapt the method to the present experiments. 
Furthermore, we detail the theoretical background of the underlying process 
and derive the respective formulas used to describe our data. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 15 May; accepted 27 August 2009. 
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METHODS 
Experimental set-up and detection of excited neutral atoms. We use a slightly 
modified version of an experimental set-up that has been described elsewhere’. 
Briefly, a linearly polarized Ti:sapphire laser beam with a repetition rate of 500- 
1,000 Hz, a variable pulse width of tpwrm = 40-120 fs and pulse energies up to 
2.5 mJ is steered into a vacuum chamber with a base pressure of ~10-? mbar. It 
is focused by means of a plano-convex lens with a focal length of 0.2m into an 
effusive beam of He or Ne atoms at right angles, where, among other processes, it 
excites neutral atoms. The beam target pressure is typically <5 X 10°’ mbar. 0.38m 
downstream in the direction of the atomic beam, a position-sensitive microchannel 
plate detector is located. The detector can be configured to detect either ions or 
electrons in the counting mode. Both configurations are suitable for excited neutral 
species detection, if they have an excitation energy of more than ~5 eV. In contrast 
to charged particles, which are easily accelerated by small electric fields (either 
deliberately applied or stemming from spurious charges) and hit the detector after 
a few microseconds, neutral atoms travel at thermal velocities reaching the detector 
only after hundreds of microseconds. Consequently, the excited state needs to 
survive long enough that the atom can reach the detector in an excited state. It 
has been found in earlier work"* that an intense laser interacting with a beam of rare 
gas atoms produces mainly Rydberg states with principal quantum numbers around 
n= 10, which decay partially to a metastable state with high excitation energy, for 
example, ~20 eV in He. The metastable state lives long enough to reach the detector. 
We estimate that about 1% ofall excited atoms reach the detector in an excited state. 
Owing to the divergence of the atomic beam, the signal on the detector in laser beam 
direction is an image enlarged by a factor of 2.3 of the distribution of excited atoms in 
the laser beam, in case the momentum transferred to the atom during the excitation 
can be neglected. This needs to be taken into account when transforming the detector 
signal along the zp coordinate into the laser beam z coordinate. The high detection 
sensitivity to accelerating forces during the laser pulse lies in the nature of neutral 
excited atoms travelling with thermal velocities. The time-of-flight distribution of He 
atoms at the microchannel plate detector peaks around 230 [1s. Because momentum is 
only transferred to the atoms during the laser pulse duration, the starting point of the 
atoms is well defined. By setting a 40-[1s-wide gate on the maximum of the time-of- 
flight distribution, a precise determination of the radial velocity can be calculated from 
the radial beam deflection on the detector. We note that the centre-of-mass motion of 
a neutral atom is largely unaffected by spurious electric fields or space charge. 
Derivation of the ponderomotive centre-of-mass force on neutral atoms. To 
derive the centre-of-mass force on the neutral atom we solve the coupled Lorentz 
equations for the ion with mass m, and charge q; and electron with mass m and 
charge q including the Coulomb interaction between them. The two Lorentz 
equations in atomic units read: 


F, =m fF, =q (Ei +v; x By) +E, (6) 


Fy = mt) = (En +v, x Bo) —F, (7) 
where E;=E(r;,t) and B;=B(r;,t). The Coulomb force F, is given by 
F.=q@ me. Using the first-order electric and magnetic fields for a focused 
Gaussian beam™, we can write the equations explicitly: 

mX, = E\x+y, Biz — 21 By + Fx 

my, = —X, By, + Fy (8) 

mZ, =X, By + Foe + Ez 
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MX_ = — Eyx — fy Boz + 22 Bry — Fx 
MY = X27 Boz — Fy (9) 


MZ = — X2 Boy — Fz — Enz 


Equation (2) can be derived analytically from the Lorentz forces using centre- 
of-mass and relative coordinates. We obtain the force on the centre-of-mass by 
using the following approximations. The fields are approximated to first order 
and we assume the slowly varying amplitude approximation; the relative velocity 
and position are approximated through the motion of the electron in the electric 
field in polarization (x) direction, neglecting the Coulomb force. 

By solving equations (8) and (9) numerically without the aforementioned 
approximations, we obtain, for specific initial conditions, the bound trajectories 
of the tunnel electron at the end of the laser pulse as described in ref. 14. Most 
importantly, by evaluating the centre-of-mass velocity from the numerical cal- 
culations we confirm the results of equation (2). 

To give an interpretation of why the electron remains bound, we first note 
that its motion during the laser pulse is mainly driven by the laser field. If we 
suppose that the electron starts at a maximum of a field cycle close to the 
nucleus, that is, where the Coulomb field is strong, the electron then revisits 
its starting (inner turning) point periodically with the laser frequency. Here, the 
attractive force is strongly peaked. For example, if the electron is set free at a 
distance of 10 atomic units (a.u.) from the nucleus, the Coulomb force is 
10? a.u. acting for roughly a tenth of the laser cycle duration. If one compares 
this periodical force with the steady ponderomotive force F,, which is of the 
order of 10 *a.u. for [= 0.1a.u. and beam waist of wo = 16§m we find an 
attractive net force. 

Sign of the centre-of-mass force on a neutral atom in a laser field. A focus of a 
continuous-wave laser beam can act as a quasi-static dipole trap for neutral 
atoms’ provided the photon energy of the laser is much below the lowest excited 
state. In this case the atom is a strong-field seeker. In our experiment, the force on 
the centre-of-mass motion according to equation (4) is outward bound, that is, 
the atom in the strong short-pulsed laser field is a low-field seeker, although the 
photon energy of the laser field is much below the lowest excited state. Obviously, 
the effective polarizability of an atom in a strong laser field, where the electron 
can be treated as quasi-free, differs in sign from a ground-state atom in a weak 
continuous laser field, where the strongly bound electron is only weakly 
perturbed by the laser field. According to the analysis of ref. 25 there is a rela- 
tionship between the dynamical polarizability of an atom and the ponderomo- 
tive potential for a free electron U,, from which the ponderomotive force can be 
obtained by F, = -VU,. For a quasi-free electron, as is the case for our experi- 
ments, U, = _*, E?. The more general ponderomotive potential for a bound 


4mar 2 
: Sa és 2 2 2 ys 
electron with a binding frequency @ is given by U, = eae) E*, which can 


be derived from the Lorentz model of atomic polarizability. It is immediately 
obvious that for a strongly bound electron with w < @, the sign of the ponder- 
omotive potential is opposite and the absolute value is much smaller than for a 
free electron. 
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Preserving electron spin coherence in solids by 
optimal dynamical decoupling 


Jiangfeng Du’, Xing Rong’, Nan Zhao*, Ya Wang’, Jiahui Yang’ & R. B. Liu? 


To exploit the quantum coherence of electron spins in solids in 
future technologies such as quantum computing””, it is first vital 
to overcome the problem of spin decoherence due to their coupling 
to the noisy environment. Dynamical decoupling*’, which uses 
stroboscopic spin flips to give an average coupling to the environ- 
ment that is effectively zero, is a particularly promising strategy for 
combating decoherence because it can be naturally integrated with 
other desired functionalities, such as quantum gates. Errors are 
inevitably introduced in each spin flip, so it is desirable to minimize 
the number of control pulses used to realize dynamical decoupling 
having a given level of precision. Such optimal dynamical decoup- 
ling sequences have recently been explored”. The experimental 
realization of optimal dynamical decoupling in solid-state systems, 
however, remains elusive. Here we use pulsed electron paramag- 
netic resonance to demonstrate experimentally optimal dynamical 
decoupling for preserving electron spin coherence in irradiated 
malonic acid crystals at temperatures from 50 K to room temper- 
ature. Using a seven-pulse optimal dynamical decoupling sequence, 
we prolonged the spin coherence time to about 30,1s; it would 
otherwise be about 0.04 p's without control or 6.2 ps under one-pulse 
control. By comparing experiments with microscopic theories, we 
have identified the relevant electron spin decoherence mechanisms 
in the solid. Optimal dynamical decoupling may be applied to 
other solid-state systems, such as diamonds with nitrogen-vacancy 
centres'*, and so lay the foundation for quantum coherence con- 
trol of spins in solids at room temperature. 

The idea of dynamical decoupling originated in spin echo'®’” in 
nuclear magnetic resonance, in which a single flip of nuclear spins by 
a radio pulse makes the noises ‘felt’ by the spins before and after the 
pulse cancel each other, at least partially. Recently, progress has been 
made on electron spin echo in solid-state systems, such as quantum dots 
and impurities'**°. The standard periodic Carr—Purcell-Meiboom— 
Gill (CPMG) pulse sequences”” can keep the spin coherence longer by 
decoupling a spin from its environment’’. Iteratively concatenated 
sequences” * can realize the dynamical decoupling to an arbitrarily high 
order of precision, but the number of pulses increases exponentially 
with the controlling order. A remarkable advance in dynamical decoup- 
ling theory is the discovery by Uhrig’ of optimal sequences, which 
realize the Nth order of dynamical decoupling precision with the min- 
imum number (N) of spin-flip pulses. The Uhrig dynamical decoupling 
(UDD) was first discovered for a specific spin-boson model’, and was 
later conjectured"® and rigorously proved"! to be model-independent. 
The first experimental study of UDD was done with trapped ions with 
the noise mimicked by modulating the controlling system'?. Here we 
report the experimental demonstration of UDD for electron spins in 
solids, namely, radical electron spins in y-irradiated malonic acid single 
crystals, from 50K to room temperature. Our results demonstrate 
that the UDD performs better than the periodic CPMG dynamical 


16,17 


decoupling (PDD). The experimental data from different samples 
under various conditions are in good agreement with calculations based 
on microscopic theories (with only one fitting parameter, fixed at the 
same value for all samples under all conditions). We demonstrate the 
strengths as well as some limitations of the dynamical decoupling in 
forestalling various decoherence mechanisms relevant in the solids. 

A general spin-environment interaction is described by a 
Hamiltonian as 


H=y,(Bo + 8B):6/2+ He (1) 


where y, is the electron gyromagnetic ratio, 6 indicates the Pauli 
matrices (0,,0,,0,) representing the quantized spin moment, By is 
the external magnetic field, 5B is the fluctuation part of the local 
magnetic field due to coupling to the environment, and 7/, contains 
many-body interactions within the environment. 5B contains a static 
fluctuation resulting from thermal distribution of environmental 
states, usually referred to as inhomogeneous broadening. Also, a 
dynamical quantum fluctuation will be built up, since an environ- 
mental state with a certain eigenvalue of 8B, driven by the many-body 
interaction Hg, will evolve into a superposition of different local- 
field eigenstates. The spin precesses about the local magnetic field 
Bo + 5B, and the fluctuation 5B leads to the decay of the spin polari- 
zation (6) that quantifies the quantum coherence. 

Dynamical decoupling aims to reduce the effective random field by 
flipping the spin from time to time. For example, by applying instan- 
taneous 1-rotations at ft, t, ..., t along the directions nj, no, ..., Ng 
respectively, the time-averaged effective Hamiltonian Hee, is given by 
e Mert = (— j)ke MWe... ge MWg, eM with 6; = no 
which flips the spin along n;. The target is to eliminate the fluctuating 
field to a given order of t, that is, Herp = He + Of tY*1)_ As errors are 
inherently introduced by the controlling pulses, we should ideally use 
the minimum number of pulses for a given order of decoupling 
precision. For a pure dephasing model in which the random field is 
fixed along a certain direction, say, the z axis, the UDD contains N 
spin-flip pulses applied at 

in 
IND 7 
+1) ( 


t=tsin* =1,2,...,N (2) 
to eliminate the coupling up to O( see Fig. 1c). For compar- 
ison, the PDD has the periodic timing t;= t(2j— 1)/(2N). Below we 
abbreviate the N-pulse UDD and PDD as UDDN and PDDN, 
respectively. Note that UDD1 and UDD2 are identical to the standard 
Hahn echo (PDD1) and Carr—Purcell control (PDD2)"’, respectively. 
But the higher order Uhrig sequences become non-periodic. 

To test the dynamical decoupling for spins in solids, we took 
y-irradiated malonic acid single crystals as the benchmark systems. A 
schematic ofa unit cell of the crystal is shown in Fig. la. The y-irradiation 
may remove one hydrogen atom from a methylene group (—CH)) and 
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Figure 1| System and methods for the dynamical decoupling experiments. 
a, A unit cell of a malonic acid crystal contains two (CH2)(COOH)» 
molecules. The C-H bond ofa radiation-generated radical and the magnetic 
field Bo are in the carbon plane and form an angle  ~ 27/3. b, Energy levels 
of the electron and proton spins in a radical. The arrow indicates the 
transition used in the pulsed EPR. c, Pulse sequences for UDD1-7 and 
PDD7. d, First-derivative CW EPR spectrum measured at 9.722 GHz. 


create a radical (-CH). The spin-1/2 of the unpaired carbon valence 
electron is the object under consideration (referred to as qubit hereafter 
for clarification). We estimated the radical concentrations of the three 
single crystals used in the experiments at 32, 8 and 0.6p.p.m. (see 
Supplementary Information for methods). Individually accessing single 
qubits is not necessary for testing dynamical decoupling, and we instead 
addressed ensembles of electron spins for enhanced signals. An addi- 
tional advantage of using spin ensembles is that dynamical decoupling 
can be tested on unwanted couplings between qubits, an issue pertinent 
to large-scale quantum computing. Figure 1d shows a typical electron 
spin resonance in the continuous wave (CW) electron paramagnetic 
resonance (EPR) spectrum. The resonance is split owing to the hyperfine 
coupling (Fig. 1b) with the spin of the o-proton”, the hydrogen nucleus 
in the radical. The crystals are oriented such that the hyperfine inter- 
action with the o-proton is Hc_y~2n(AP]j+Ap Ip )o2/2, with 
Aj = —45 MHz, and Aj° = 26 MHz (see Supplementary Information 
for details). Here the electron spin flip terms associated with o,,, have 
been dropped as the Zeeman energy ).By is much greater than the 
hyperfine interaction strength A. Such coherent electron—nuclear spin 
coupling has been studied for the purpose of quantum information 
processing”. 

In the dynamical decoupling experiments, the spin precession was 
initiated from the thermal equilibrium by a 7/2 pulse. Then N spin- 
flip pulses in the UDD or PDD timing were applied to protect the 
spin coherence. The areas of the echoes at t were measured as the spin 
coherence. 

Echo signals in samples of various radical concentrations measured 
at various temperatures are shown in Figs 2 and 3, and compared with 
the theoretical calculation. Preservation of the spin coherence by 
dynamical decoupling is evidenced by the elongation of the coherence 
time with increasing the number of control pulses. The overall deco- 
herence profile is non-exponential, with an initial exponential decay 
followed by a rapid super-exponential drop. To quantify the coherence 
preservation by dynamical decoupling control, we derived the deco- 
herence time T, when the signal drops to 1/e its value at t = 0 which was 
obtained by extrapolation using exponential fitting of the initial decay 
(see Supplementary Information for details). The decoherence times in 
the 0.6 p.p.m. sample at 50 K under UDD1-7 and PDD1-7 control are 
listed in Table 1. It is clearly seen that UDD outperforms PDD in 
preserving the spin coherence. 

Electron spin decoherence in solids can be caused by any of the 
following mechanisms: (1) direct relaxation due to spontaneous emis- 
sion of photons; (2) spin-lattice relaxation via phonon scattering and 
spin-orbit interaction; (3) hyperfine interaction with nuclear spins; 
(4) coupling between qubits; and (5) coupling to other impurity 
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Figure 2 | Electron spin decoherence under UDD and PDD control. a, b, The 
UDD (a) and PDD (b) experiments. The integer associated with each curve 
indicates the number of control pulses. The sample had a radical 
concentration 0.6 p.p.m. and the temperature was 50 K. c, d, The theoretical 
calculations for a and b, respectively. Insets, semi-logarithmic plots of the 
same data; the variables plotted on the axes are the same as in the main 
panels. 


electron spins. The direct photon emission rate in our resonator was 
estimated at ~10 *s' (see Supplementary Information), so this par- 
ticular mechanism can be disregarded given the timescale considered 
here. We clarified possible contributions of the other mechanisms 
using experiments for samples with various radical concentrations 
at various temperatures and theoretical calculations based on micro- 
scopic models. 

Experiments at various temperatures (Fig. 3a) singled out the 
effect of spin-lattice relaxation. To obtain a sufficiently strong echo 
signal at room temperature, the 32 p.p.m. sample was used. We 
found the spin decoherence time (T>, in pls) to be 7.7 + 0.4, 
7.7 £ 0.4, 7.7 £ 0.3 and 7.6 + 0.3 for temperature (T, in K) T= 50, 
100, 200 and 300, respectively. In previous CPMG experiments”, a 
thermal activation of spin-lattice relaxation around 150K was 
observed. Here in our own experiments, any such thermal activation 
was too small to be resolved to the precision of the decoherence time 
(AT>). Considering that the spin-lattice relaxation time measured at 
50K was as long as 940 us (Supplementary Information), the spin 
decoherence rate attributable to phonon scattering at room temper- 
ature was below 57> / TS ~ 1/(150 us). Such a small phonon scatter- 
ing effect is probably due to the weak spin—orbit coupling in carbon 
atoms. Thus we expect that the coherence preservation effect of 
dynamical decoupling shown in Fig. 2, which was measured at 50K 
for a sufficiently strong signal in the low concentration sample, would 
be similar if measured at room temperature. 

Hyperfine interaction with nuclear spins results in a random local 
magnetic field (Overhauser field), dB = Yo. On AY IT}, 

i>0,0=x,y,z 
where A; is the hyperfine coupling tensor to the ith nuclear spin I;. 


The thermal distribution of the Overhauser field (that is, nhomoge- 
neous broadening) deduced from the resonance line width in the CW 
EPR spectrum (Fig. 1d) is AB,¢~ 2.0G, consistent with the previous 
measurement”. The free-induction decay of the electron spin coher- 
ence due to the inhomogeneous broadening would have a timescale 
T= V2(y.ABnp) '~40 ns. Such rapid decay was not resolved in our 
pulsed EPR spectrometer. The effect of the static inhomogeneous 
broadening, however, is fully eliminated from the echo signals. 
More relevant is the dynamical fluctuation of the local Overhauser 
field. The dynamical fluctuation 5By¢(t) can be driven by pair-wise 
nuclear-spin flip-flops due to the dipole-dipole interaction (see 
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Figure 3 | Effects of various decoherence mechanisms in malonic acid 
crystals. a, UDD5 echo signals of the 32 p.p.m. sample at 50, 100, 200 and 
300 K. Values of T, are shown next to each curve. b, UDD5 echo signals of the 
32, 8 and 0.6 p.p.m. samples at 50 K. The curves with symbols are theoretical 
results. Inset, semi-logarithmic plot of the same data; the variables plotted 
on the axes are the same as in the main panel. c, UDD1-7 echo signals of the 
0.6 p.p.m. sample at 50 K, each divided by the exponential factor which fits 
the initial stage of decoherence. d, UDD1-7 echo signals, calculated 
considering only the dynamical Overhauser field fluctuation driven by 
nuclear dipolar interaction. 


Supplementary Information). The coupled electron—nuclear spins in 
the y-irradiated malonic acid crystal provide an ideal platform for 
studying the paradigmatic problem of decoherence of a single qubit 
or centre spin embedded in an interacting spin bath**>”*. We calcu- 
lated the electron spin decoherence in the nuclear spin baths using a 
quantum theory developed” to properly take into account the many- 
body correlations in the bath (Supplementary Information). The 
result is shown in Fig. 3d. 

Dynamical Overhauser field fluctuation can also be caused by the 
random displacements of nuclei relative to the electron spin, espe- 
cially in the case of nuclei near the radical. This effect is possible in the 
molecular crystals, which are loosely bound by hydrogen bonds and 
van der Waals forces. For example, at 4.2 K the radical would have 
two stable conformations separated by a low barrier, with the C-H 
bond tilted away from the carbon plane by + 12° (ref. 30). At slightly 
higher temperatures, the radical randomly hops between the two 
conformations and appears to be in the middle. This may result in 
extra dynamical fluctuation of the Overhauser field. Calculation of 
this effect, however, requires detailed knowledge of the electronic 
states, geometrical conformations, and hopping dynamics of radicals 
in the molecular crystal, which is not yet available. 

Another component of the random local field comes from the 
dipolar interaction between a ‘qubit’ spin and other radical electron 
spins. This mechanism is particularly relevant for large-scale 
quantum computing with many qubits, where unwanted interaction 
between qubits is unavoidable. The static inhomogeneous broaden- 
ing 5B. due to the electron—electron spin interaction, proportional 
to the radical concentration, ranges roughly from 0.001 to 0.05 G in 
our samples, which is far less than the Overhauser field broadening 
and hence is unimportant in the CW EPR spectrum or the free- 
induction decay. But in contrast to nuclear spins, the ‘bath’ electron 
spins would be flipped together with the ‘qubit’ spin when their 
frequencies fall into the spectrum of the short control m-pulses. As 
a result, 5B. was not fully eliminated and became important in the 
echo signals. The effect of the electron—electron spin interaction is 
shown in Fig. 3b. The decoherence time under UDD5 control in the 
32p.p.m. sample (7.7 + 0.4 us) is much shorter than that in the 
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Table 1| Decoherence times of electron spins under UDD and PDD control 


No. of pulses Decoherence time (1s) 


UDD experiment* UDDtheoryt PDDexperimentt PDD theory8 
1 6.2+0.1 7.56 6.2401 7.56 
2 13.3+0.4 15.7 13.3+0.4 15.7 
3 18.5+0.5 20.1 13.5+0.4 20.9 
4 21.4+0.4 24.5 176412 22.0 
5 24.3+0.9 25.7 18.8+0.9 24.8 
6 30.2+1.5 28.1 20.2+ 1.0 255 
7 27.1+0.9 29.0 2190: 25 27.0 


* For data shown in Fig. 2a. 
+ For data shown in Fig. 2c. 
+ For data shown in Fig. 2b. 
§ For data shown in Fig. 2d. 


0.6 p.p.m. sample (24.3 + 0.9 ts). This indicates that the electron— 
electron spin interaction dominates the decoherence in the high 
concentration sample. We confirmed this conclusion by calculation, 
considering the inhomogeneous broadening due to the electron spin 
interaction and the finite duration of the control pulses (Fig. 3b, see 
Supplementary Information for methods). Coupling to impurities 
should have little effect, since those impurities, if any, would be far 
off-resonant and hence any inhomogeneous broadening on their 
account would be totally removed by spin echoes. 

All the experimental data in Figs 2a, 2b and 3b are in good agreement 
with the calculations (Figs 2c, 2d and 3b), which took into account the 
static inhomogeneous broadening due to electron—electron spin inter- 
action and the dynamical Overhauser field fluctuation. The dynamical 
field fluctuation caused by random displacement of nuclei was 
described by a phenomenological decay factor exp (— t/T3), with T3 
being the only fitting parameter and fixed the same (30 us) in all the 
calculations. The decoherence due to random conformation- 
hopping may account for the saturation of the spin coherence time 
at about 30 us in the 0.6 p.p.m. sample as the UDD goes beyond the 
sixth order (Fig. 2a). In samples with higher radical concentrations, the 
saturated coherence time is much shorter (for example, about 7.7 [Us in 
the 32 p.p.m. sample). This is because dynamical decoupling can not 
decouple the qubits from the other electron spins in near-resonance. If 
we consider only the dynamical Overhauser field fluctuation driven by 
nuclear dipolar interaction, the calculation (Fig. 3d) shows that the 
decoherence under dynamical decoupling control presents a plateau 
with width increasing roughly linearly with the order of dynamical 
decoupling, and under UDD7 the coherence persists beyond 50 us. 
These features are indeed in agreement with the experimental data 
excluding the initial exponential decay (Fig. 3c). In the plateau regime, 
the decoherence would be dominated by the other mechanisms, such 
as the electron—electron interaction and the random conformation- 
hopping. This explains the initial exponential decay and the overall 
non-exponential profile seen in Figs 2 and 3. The small but noticeable 
discrepancy between the experimental and theoretical results may be 
ascribed to the lack of an accurate model of the hyperfine interaction, a 
precise estimation of radical concentrations, and a full understanding 
of the molecule conformation dynamics. Further study of the micro- 
scopic decoherence mechanisms in the solids is desirable. 


METHODS SUMMARY 


Malonic acid single crystals were grown from saturated aqueous solutions by 
slow evaporation at 5 °C, y-irradiated at room temperature with various doses, 
and aged at 67 °C for 15h to eliminate unstable radicals. The crystal structure was 
determined by X-ray diffraction. The radical concentrations were estimated 
from the areas of CW EPR resonances compared with a standard sample. 

In the dynamical decoupling experiments, the spin precession was triggered by 
a m/2-rotation using a 28-ns pulse. The spins were flipped by 56-ns pulses in the 
UDD or PDD timing. The area of the echo signal was measured. To avoid the 
interference of unwanted echoes in the multi-pulse experiments’’, phase cycling 
was applied. The magnetic field and the pulse frequency were 3,477G and 
9.722 GHz for the 32 p.p.m. sample, 3,454G and 9.664 GHz for the 8 p.p.m. 
sample, and 3,447G and 9.645 GHz for the 0.6p.p.m. sample, respectively. 
The signals near t= 0 were not measured owing to the dead window of the 
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spectrometer but obtained by extrapolation using exponential fitting of an initial 
range of data. The standard deviations of the fitting were added to the error bars 
of the decoherence times. 

The photon emission rate was estimated considering the Purcell enhancement 
factor of our cavity. The nuclear spin bath dynamics were calculated using the 
cluster correlation expansion” including up to four-spin clusters. Including the 
nearest 500 nuclear spins around a radical was sufficient to produce converged 
results. The decoherence due to interaction with a ‘bath’ electron spin was 
calculated considering the rotation under the finite-duration pulses, the con- 
tributions from different electron spins were summed, and the results were 
averaged on position configurations. 
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A dearth of intermediate melts at subduction zone 
volcanoes and the petrogenesis of arc andesites 


Olivier Reubi'+ & Jon Blundy' 


Andesites represent a large proportion of the magmas erupted at 
continental arc volcanoes and are regarded as a major component in 
the formation of continental crust’. Andesite petrogenesis is there- 
fore fundamental in terms of both volcanic hazard and differenti- 
ation of the Earth. Andesites typically contain a significant 
proportion of crystals showing disequilibrium petrographic charac- 
teristics indicative of mixing or mingling between silicic and mafic 
magmas, which fuels a long-standing debate regarding the signifi- 
cance of these processes in andesite petrogenesis* and ultimately 
questions the abundance of true liquids with andesitic composition. 
Central to this debate is the distinction between liquids (or melts) 
and magmas, mixtures of liquids with crystals, which may or may 
not be co-genetic. With this distinction comes the realization that 
bulk-rock chemical analyses of petrologically complex andesites can 
lead to a blurred picture of the fundamental processes behind arc 
magmatism. Here we present an alternative view of andesite petro- 
genesis, based on a review of quenched glassy melt inclusions 
trapped in phenocrysts, whole-rock chemistry, and high-pressure 
and high-temperature experiments. We argue that true liquids of 
intermediate composition (59 to 66 wt% SiO.) are far less common 
in the sub-volcanic reservoirs of arc volcanoes than is suggested by 
the abundance of erupted magma within this compositional range. 
Effective mingling within upper crustal magmatic reservoirs 
obscures a compositional bimodality of melts ascending from the 
lower crust, and masks the fundamental role of silicic melts (=66 
wt% SiO) beneath intermediate arc volcanoes. This alternative view 
resolves several puzzling aspects of arc volcanism and provides 
important clues to the integration of plutonic and volcanic records. 

Quenched glassy melt inclusions trapped in phenocrysts provide 
snapshots of liquid (melt) evolution during crystallization. As such, 
they represent the only unequivocal information on the composition 
of true volcanic liquids in petrologically complex magmas. A com- 
pilation of published melt inclusion data (nm = 2,582) covering more 
than 85 arc volcanoes extracted from the GEOROC global compila- 
tion shows a range of SiO, content from 40 to 83 wt% (calculated on 
an H,O-free basis). Although the range of SiO, content is continu- 
ous, the distribution is strongly bimodal, with major peaks around 54 
and 76 wt% (Fig. la). There is a marked paucity of melt inclusions 
with intermediate (andesite) composition, particularly between 59 
and 66 wt% SiO, despite the abundance of volcanic rocks of this 
composition in arcs. Minima at intermediate compositions are also 
observed for MgO, CaO, FeO and K,0. A similar conclusion has been 
drawn’ from a compilation of melt inclusions for igneous rocks from 
all tectonic settings. 

Melt inclusions show a good correlation with the bulk composi- 
tion of their host rock, for host compositions <55 and >71 wt% 
SiO». The correlation persists for rocks up to 59 wt% SiOs, although 
these samples occasionally also contain melt inclusions significantly 


more evolved than their host rock. The vast majority of intermediate 
rocks contain melt inclusions that are rhyolitic in composition, irre- 
spective of host-rock composition (Fig. 1c, d). The striking feature of 
Fig. 1 is that the pronounced intermediate composition peak observed 
in the bulk rock record is displaced towards significantly more silicic 
compositions in the melt inclusion record. This shift is independent of 
the mineral phase in which the melt inclusion is hosted (Fig. 1d) and 
cannot, therefore, result from post-entrapment crystallization of the 
host mineral or other systematic biases in the melt inclusion record 
(Methods). 

It is widely held that melt inclusions in arc magmas record volatile- 
saturated conditions*’. H,O and CO, contents in melt inclusions 
indicate typical pressures of entrapment below 550 and 350 MPa for 
mafic and silicic melts, respectively’, reflecting crystallization in the 
upper crust. Melt inclusions from individual volcanoes show steep to 
nearly sub-vertical negative correlations between H,O and SiO, 
(Fig. 2), due to varying degrees of decompression-driven crystalliza- 
tion’. This process does not, however, appear to drive appreciable 
chemical differentiation of magmas owing to the inefficient separation 
of crystal and melt phases. Melt inclusions suggest that low-pressure 
crystallization of mafic melts can produce residual melts with 
=60 wt% SiO, but only rarely extends to high-silica andesite (Figs 1c 
and 2). This limit also corresponds to the most evolved composition of 
aphyric (crystal-free) andesites, for example®’, and to the most evolved 
compositions erupted from mafic cones and shield volcanoes in arcs*. 
The melt inclusion record suggests that this threshold corresponds to 
the upper limit of vapour-saturated differentiation of mafic magmas in 
the upper crust. Residual melts in crystal cumulates or groundmasses 
certainly evolve beyond 60wt% SiO», but the high crystallinity 
involved precludes efficient separation of melt from crystals. In 
summary, the melt inclusion record indicates that melts reaching 
the upper crust, where they crystallize mostly in response to degassing, 
are compositionally bimodal with a gap from 60 to =65 wt% SiO3. It is 
not exceptional for this gap to extend to 70 wt% SiO,. Low-pressure 
crystallization of mafic melts partially fills the gap, but the proportion 
of unequivocal melts with 60-66 wt% SiO, remains remarkably low in 
shallow magmatic systems. 

Experimental crystallization of hydrous basaltic melts over a range 
of crustal pressures”“* elucidates the possible liquid lines of descent 
during crustal differentiation. Experimentally produced melts describe 
curved fractionation trends that culminate in the compositions of 
silicic magmas and melt inclusions (Fig. 3), demonstrating that frac- 
tionation of mafic magmas at some level within the crust is capable of 
producing these silicic liquids. A further contribution may come from 
crustal contamination, as often required by whole rock isotopic 
data”’>'®. Both mafic and silicic melt inclusions show a good match 
to experimental liquids, suggesting that melt inclusions preserve true 
melt compositions produced by crystallization of hydrous basaltic 
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Figure 1| SiO, contents (H,O-free) of melt inclusions in arc magmas. 

a, Global compilations of arc magma melt inclusion and bulk rock 
compilation for North, Central and South America (GEOROC database). 
Plutonic curve shown for comparison is a compilation of data for the 
Lachlan fold belt, Australia (B. W. Chappell, personal communication), 
Adamello batholith, Italy (P. Ulmer, personal communication) and Sierra 
Nevada batholith, USA (NAVDAT compilation for Cretaceous plutonic 
rocks). b, Melt inclusion compositions in arc rocks, classified by host mineral 
phase. c, Compositions of melt inclusions versus host rock compositions for 
arc volcanoes (subset of the data in a). d, Distribution of melt inclusions and 
their host rock compositions for the data subset in c. 
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Figure 2 | H,O-SiO, contents of melt inclusions from several arc 
volcanoes. SiO, contents are calculated on an H,O-free basis, and chemical 
analyses are normalized to a total of 100%; data are taken from refs 4, 28-31, 
and Smith, V. C. et al., manuscript in preparation. Inset, schematic trends 
anticipated for different crystallization mechanisms. Trends for individual 
volcanoes are indicative of various degrees of decompression-crystallization 
in vapour-saturated melts, and indicate that relatively limited differentiation 
occurs in the upper crust. 


melts. However, only a limited number of intermediate melt inclusions 
correspond to experimental melts, endorsing a view’”’ that intermedi- 
ate melts often result from magma mixing. This view is corroborated 
by the fact that most suites of intermediate rocks describe linear 
chemical trends, rather than the curved trends generated experiment- 
ally. It is striking that the mismatch between the experimental liquid 
lines of descent and intermediate bulk rocks exactly corresponds to the 
compositional gap recorded by the melt inclusions, leaving little doubt 
that the vast majority of intermediate arc magmas do not represent 
melts produced by differentiation of mafic magmas. 

The melt inclusion record (Fig. 1c, d) shows that andesitic vol- 
canoes contain almost exclusively dacitic to rhyolitic liquids. The 
majority of intermediate rocks lack mafic melt inclusions, which 
raises questions as to what drives their bulk composition back to 
andesite. The most commonly invoked process is magma mixing, 
for which there is abundant petrological evidence in andesitic rocks'’. 
However, the paucity of andesitic melt inclusions suggests that mix- 
ing itself does not produce large volumes of hybrid andesitic melts. At 
Volcan de Colima (Mexico), for example, there are strong indica- 
tions that the linear andesitic bulk-rock trends are controlled by 
incorporation of gabbroic fragments into silicic melts, as represented 
by melt inclusions'*. The gabbroic fragments are themselves a 
product of crystallization of ancestral mafic magmas beneath the 
volcano. The presence of disequilibrium mafic crystals and plutonic 
nodules is a well known feature of intermediate arc magmas", sug- 
gesting that mechanical incorporation of mafic rock fragments into 
silicic melts may be a widespread process. Incorporation could result 
either from assimilation of mafic plutonic roots by ascending silicic 
melts’? or incomplete separation of restitic fragments and residual 
silicic melts during extraction from their source regions”’. Given the 
abundance of plutonic nodules at many volcanic arcs, it is axiomatic 
that most ascending magmas encounter and interact with ancestral 
crystalline residues before eruption. 

Considering that melt inclusions record upper crustal pressures 
(=550 MPa), the compositional gap must be produced during an 
early stage of magmatic evolution, in the mid to deep crust, implying 
that melt compositional diversity is acquired at depth, as suggested in 
refs 16 and 21. In view of the good agreement between the experi- 
mental liquid lines of descent of hydrous basalts and silicic melt 
inclusions (Fig. 3), the compositional gap is probably related to the 
formation and/or extraction of silicic melts during crystallization of 
basalts and/or melting of mafic source rocks in the mid to deep crust. 
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Figure 3 | Comparison of chemical variation in melt inclusions, volcanic 
bulk rocks and experimental melts. The represented liquid lines of descent 
(curves a—e in top panels, shown in panels below) are based on experimental 
studies of hydrous basalts over a range of crustal pressures (0.1—-1.2 GPa). 
Bulk rock data (symbols) are from a GEOROC compilation for North, 
Central and South America. Melt inclusions as in Fig. 1. The mismatch 


Melts produced experimentally by crystallization of hydrous basalts 
span a continuous range of composition without any obvious gap 
(Fig. 3). However, over a wide range of crustal pressures, the slope of 
the liquidus in temperature-composition space shallows markedly 
between 58-60 and 64—68 wt% SiO;, which correlates well with the 
gap recorded by the melt inclusions (Fig. 4). As originally proposed”, 
large compositional changes over small temperature intervals signifi- 
cantly reduce the likelihood of extracting compositions within these 
intervals compared to compositions corresponding to steeper por- 
tions of the liquidus at higher and lower SiO. We propose that the 
andesitic gap shown by the melt inclusion record is a consequence of 
the phase relations of hydrous basalts in the mid to lower crust. Both 
evolved and basaltic compositions ascend and crystallize in the shal- 
low crust: the production of the vast majority of andesitic rocks is 
simply a consequence of the blending of these components in upper 
crustal reservoirs. 

Recognizing the existence of a gap in melt composition and the 
mixed nature of most erupted andesites solves several puzzling 
aspects of arc magmatism. A review® of Quaternary magmatism in 
the continental Cascades arc (USA) highlights the fact that the vast 
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observed between the experimental liquid lines of descent and many 
intermediate bulk rocks, which are displaced to higher MgO relative to 
experimental liquids, coincides with the compositional gap recorded by the 
melt inclusions (see Fig. 1), strongly suggesting rarity of true intermediate 
melts in arc settings. 


majority of intermediate magmas (57-68 wt% SiO.) were erupted 
from 22 major stratovolcanoes, whereas more than 2,000 single vents 
dispersed throughout the arc produced mafic magmas (<60 wt% 
SiO). That the predominant mafic cones and shield volcanoes do 
not grade into high silica andesites is difficult to explain if silicic melts 
were produced by low-pressure basalt crystallization, but is consist- 
ent with a melt inclusion record showing that, in most cases, vapour- 
saturated crystallization of basaltic melt is limited to producing rocks 
with less than 60 wt% SiO>. The review in ref. 8 also shows that, in 
contrast to intermediate magmas, whose occurrence is restricted to 
major stratovolcanoes, silicic magmas were erupted from independ- 
ent peripheral vents and dome fields. With the exception of the 22 
major stratovolcanoes, the Cascades therefore shows evidence for a 
clear bimodality. Recent studies of oceanic arc bulk rocks have also 
shown distinct bimodality, with minima at high silica andesite**™*. 
This concurs with the melt inclusion and experimental records, indi- 
cating that intermediate liquids are not typically primary products of 
the subduction factory. 

The link between plutonic and volcanic rocks is the key to under- 
standing the formation and evolution of magmatic systems. Recent 


1271 


©2009 Macmillan Publishers Limited. All rights reserved 


LETTERS 


@ Ref. 11; 51.3 wt% SiOz, 700 MPa, QFM 
1,100 © Ref. 11; 51.3 wt% SiO,, 700 MPa, 
Ref. 11; 53.9 wt% SiO,, 700 MPa, QFM 
@ Ref. 9; 56.3 wt% SiOz, 100 MPa 
1,050 © Ref. 9; 56.3 wt% SiO,, 150 MPa 
Ref. 9; 56.3 wt% SiO,, 200 MPa 
Volcan de Colima 
—~ 1,000 
Oo Mount St Helens| 
= 
= 
3 950 
oO 
a 
& 
& 900 
850 Minimum gap 
recorded by melt inclusions 
800 : 


62 64 66 68 70 72 74 76 
SiO, in melt (wt%) 


54 56 58 60 


Figure 4 | Experimental liquid lines of descent in 
temperature-composition space at a range of crustal pressures 
(100-700 MPa). Grey areas represent the compositional gap observed in 
the melt inclusion record. Volcan de Colima marks the minimum extension 
of this gap, whereas Mount St Helens represents the common width of the 
gap at arc volcanoes (see Fig. 1b). The SiO, contents in the key are starting 
melt compositions used in experiments. 


studies have presented several lines of evidence that plutonic bulk 
rock compositions are in large part representative of solidified melts 
rather than crystal cumulates***®. Good correlations between melt 
inclusions, experimental liquid lines of descent and plutonic rock 
compositions support this hypothesis for mafic and silicic composi- 
tions (Fig. 5). However, intermediate plutonic rocks, like their vol- 
canic counterparts, appear to be products of mingling and mixing. 
Plutons commonly show clear field evidence for bimodality of 
incoming melts and subsequent mechanical and/or chemical inter- 
action between these components”, supporting an upper crustal 
mixing/mingling origin for intermediate plutonic rocks. The most 
striking discrepancies between the volcanic and plutonic records are 
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Figure 5 | Chemical variation in melt inclusions, plutonic bulk rocks and 
experimental melts. Diamonds are plutonic bulk rock compositions (data 
set as in Fig. 1a), small dots represent the GEOROC melt inclusion 
compilation for arc volcanoes, curves are experimental liquid lines of 
descent as in Fig. 3. Good correlation between bulk rock and melt inclusion 
compositions suggests that mafic and silicic plutonic rocks essentially 
represent melts. As with the volcanic rocks (Fig. 3), intermediate plutonic 
rocks show a mismatch with experimental melts, which corresponds to the 
gap recorded by melt inclusions, demonstrating that true intermediate melts 
are rare in plutons. 
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the paucity of andesite compositions and the large amount of silicic 
magmas in plutons (Figs la, 5). These discrepancies may be reconciled 
if we consider the constraints provided by the melt inclusion record, 
namely that intermediate magmas form by mingling/mixing process 
within bimodal magmatic systems and large volumes of dacitic to 
rhyolitic melts occur beneath andesitic volcanoes. In addition, plutons 
often lack the types of internal compositional zoning that would be 
consistent with in situ differentiation, suggesting, once again, that 
upper crustal magmatic systems are the site of extensive crystallization 
and blending between bimodal melts ascending from the lower crust 
rather than fractionation-driven differentiation. 


METHODS SUMMARY 


Melt inclusion data for arc volcanoes were extracted from the GEOROC 
melt inclusion global compilation (precompiled file available at http://georoc. 
mpch-mainz.gwdg.de/georoc/). Bulk rock compositions shown in figures were 
also obtained from GEOROC precompiled files. Cretaceous plutonic bulk rock 
compositions for the Sierra Nevada batholith were extracted from the NAVDAT 
compilation (http://www.navdat.org/). All data presented are calculated on an 
H,0-free basis. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Modification of melt inclusions after trapping, either through crystallization of 
the host phase onto inclusion walls or by growth of daughter crystals, is an 
enduring concern in the interpretation of melt inclusion data (see, for example, 
ref. 32). Typically, the compositions of mafic melt inclusions in olivine are 
corrected for post-entrapment crystallization of the host phase and diffusive loss 
of Fe (refs 33, 34) whereas silicic melt inclusions are uncorrected, potentially 
producing a bias in the data set presented. However, the extent of the correction 
applied to mafic melt inclusions is small (generally <2 wt% SiO2) and cannot 
generate a compositional gap. Detailed studies of silicic melt inclusions in ande- 
sites from Volcan de Colima (Mexico), Mount St Helens (USA) and Shiveluch 
(Russia)*'*”? have shown that melt inclusions show the same range of composi- 
tion irrespective of their host mineral. Post-entrapment re-equilibration of melt 
inclusions would result in compositional evolution controlled by crystallization 
of the host or diffusion through the host. In both cases, the melt inclusion will 
have a chemistry that is controlled by the crystal—melt partition coefficients and/ 
or relative diffusivity in the host crystal. Compositions modified by post- 
entrapment processes are, of necessity, specific to their host mineral. Thus melt 
inclusions in plagioclase, pyroxenes, amphibole and quartz should form distinct 
populations; this is demonstrably not the case, as can be seen from Figs land 3. 
For example, if post-entrapment crystallization played a significant role, then we 
would see a difference in SiO contents of melts trapped in amphibole (a low- 
SiO, phase) versus orthopyroxene (a high-SiO, phase). This is not observed, and 
the shift towards more silicic compositions shown by the melt inclusions com- 
pared to host rock compositions is observed for plagioclase, orthopyroxene and 
amphibole (Fig. 1) and therefore cannot result from specific entrapment or 
preservation mechanisms. 

An additional concern is the ability of melt inclusions to record the evolution 
of the melt throughout the course of crystallization. Studies of andesitic vol- 
canoes have shown that bulk rocks and melt inclusions form oblique chemical 
trends that intersect at, or close to, the composition of the least evolved melt 
inclusions *!*”°. This clearly demonstrates that at these volcanoes, at least, the 
melt inclusions effectively record the composition of the melt from close to the 
onset of low pressure crystallization (that is, to within 1-2 wt% SiO2). This may 
sound surprising, considering that crystals need to be growing from melt before 
they can trap melt inclusions. However the presence of older inherited crystals 
(antecrysts) is a well known feature of intermediate rocks, and it is likely that at 
least some melt inclusions become trapped within antecrysts incorporated in 
ascending melts. Indeed, resorption of antecrysts in superheated hydrous melts 
ascending through the crust’! and subsequent low-pressure overgrowth rims 
formed as the melts reach their H,O-saturated liquidus is an efficient melt- 
inclusion-forming mechanism". 

It is often assumed that melt inclusions out of chemical equilibrium with their 
host mineral are unrepresentative of the surrounding melt and form either from 


nature 


grain boundary processes or post entrapment re-equilibration. However, resorp- 
tion is one of several possible mechanism of melt inclusion formation** and the 
existence of melt inclusions out of equilibrium with the composition of the host 
adjacent to them is to be expected, as discussed above. Detailed compositional 
mapping of the host crystal zoning pattern and consideration of three-dimensional 
effects are often essential to establish the melt inclusion entrapment mechanism. 
Silicic melt inclusions in resorbed An-rich plagioclase**® are one such example. 
Careful evaluation of the melt inclusion chemistry, comparison between melt 
inclusions in different host minerals and with host rock and groundmass compo- 
sitions is essential to establish representativeness. Disequilibrium between melt 
inclusions and their host crystal does not itself signify that the composition of 
the melt inclusion is not representative of the melt, it is an indication of the 
mechanism of formation and entrapment. 

Melt inclusions with compositions clearly distinct from host rock and ground- 
mass glass compositions have been reported in arc magmas'*”’”. These ‘exotic’ 
melt inclusions are interpreted as reflecting localized, grain-scale dissolution 
processes in crystal cumulates and cannot be taken as representative of the 
composition of the crystallizing melt. However, because the compositions of 
the vast majority of melt inclusions are consistent with bulk rock, experimental 
melt and/or groundmass glass compositions in both mafic and silicic magmas, it 
appears that exotic melt inclusions are relatively rare and there is no doubt that 
most melt inclusions effectively record the evolution of the melts feeding arc 
volcanoes. 

We conclude that neither post-entrapment re-equilibration of melt inclu- 
sions, crystallization of the host phase nor bias in the melt inclusion record 
can explain the observed compositional gap and the occurrence of a silicic peak. 
Consequently these features represent primary characteristics of arc magmatism. 
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Visual but not trigeminal mediation of magnetic 
compass information in a migratory bird 


Manuela Zapka‘', Dominik Heyers’, Christine M. Hein’, Svenja Engels’, Nils-Lasse Schneider’, Jorg Hans’, 
Simon Weiler’, David Dreyer’, Dmitry Kishkinev', J. Martin Wild? & Henrik Mouritsen' 


Magnetic compass information has a key role in bird orientation’~, 
but the physiological mechanisms enabling birds to sense the 
Earth’s magnetic field remain one of the unresolved mysteries in 
biology**. Two biophysical mechanisms have become established 
as the most promising magnetodetection candidates. The iron- 
mineral-based hypothesis suggests that magnetic information is 
detected by magnetoreceptors in the upper beak and transmitted 
through the ophthalmic branch of the trigeminal nerve to the 
brain®”°. The light-dependent hypothesis suggests that magnetic 
field direction is sensed by radical pair-forming photopigments 
in the eyes’ and that this visual signal is processed in cluster N, a 
specialized, night-time active, light-processing forebrain region'*”. 
Here we report that European robins with bilateral lesions of 
cluster N are unable to show oriented magnetic-compass-guided 
behaviour but are able to perform sun compass and star compass 
orientation behaviour. In contrast, bilateral section of the ophthal- 
mic branch of the trigeminal nerve in European robins did not 
influence the birds’ ability to use their magnetic compass for orienta- 
tion. These data show that cluster N is required for magnetic compass 
orientation in this species and indicate that it may be specifically 
involved in processing of magnetic compass information. Further- 
more, the data strongly suggest that a vision-mediated mechanism 
underlies the magnetic compass in this migratory songbird, and that 
the putative iron-mineral-based receptors in the upper beak con- 
nected to the brain by the trigeminal nerve** are neither necessary 
nor sufficient for magnetic compass orientation in European robins. 

Thirty-six European robins (Erithacus rubecula) were caught 
within 200m of the testing site and their spontaneous migratory 
orientation was tested in modified Emlen funnels” inside wooden 
huts in the natural magnetic field (NMF) and in a magnetic field with 
geomagnetic north turned horizontally 120° anticlockwise (CMF). 
After these control tests confirmed well-oriented magnetic compass 
behaviour, one of the following surgeries was performed: bilateral 
section (N = 7; for details, see Methods) of the ophthalmic branch of 
the trigeminal nerve (V,); trigeminal sham section (N = 6; the same 
treatment except that V, was not sectioned); chemical lesion of 
cluster N (N= 13; bilateral, focal injections of ibotenic acid were 
made into cluster N); or sham lesion of cluster N (N= 10; the same 
treatment but without injection of ibotenic acid). The surgeries were 
performed by two of us (J.M.W. and D.H.) without the others know- 
ing which bird underwent which surgery. After the surgery and a 
recovery period of at least one week, the birds were retested under 
the same magnetic conditions as in the control tests (NMF and CMF) 
during spring migration. Subsamples of the cluster-N-lesioned or 
sham-lesioned birds were also tested in a natural magnetic field with 
an inverted vertical component (IMF). All orientation results were 
evaluated independently by two or three individuals who did not 


know which kind of surgery (real or sham) or which magnetic con- 
dition the birds had experienced. 

The orientation results showed that in European robins the ophthal- 
mic branch of the trigeminal nerve is not necessary (Fig. 1) for mag- 
netic compass orientation, whereas cluster N is necessary (Fig. 2a—f). 
The sham-sectioned birds oriented north in the geomagnetic field 
(a = 10° + 20° (mean vector orientation angle; 95% confidence inter- 
val), r= 0.95 (mean vector length), N= 6, P< 0.002; Fig. la). When 
geomagnetic north was turned to 240°, the same birds oriented west- 
southwest («= 245° + 26°, r= 0.90, N= 6, P<0.005; Fig. 1b). The 
trigeminal-sectioned birds also clearly oriented north in the geomag- 
netic field (a = 354° + 20°, r=0.93, N=7, P<0.001; Fig. 1d) and 
west-southwest when geomagnetic north was turned to 240° («= 
264° + 34°, r= 0.77, N=7, P<0.01; Fig. le). The 95% confidence 
intervals in Fig. 1a, b and Fig. 1d, e do not overlap, so we conclude that 
both groups significantly changed their orientation in response to the 
turned magnetic field. Furthermore, the mean orientation of both 
groups under the CMF condition was not significantly different from 
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Figure 1 | Bilateral sectioning of the ophthalmic branch of the trigeminal 
nerve does not affect magnetic compass orientation in European robins. 
The drawings show the approximate locations of the three branches of the 
trigeminal nerve. The ophthalmic branch (V}) is shown in bold. The crosses 
indicate the approximate locations at which the nerve was sectioned and a 
piece removed. a—c, Magnetic orientation of six sham-sectioned birds (MN, 
magnetic north). d-f, Magnetic orientation of seven trigeminal-sectioned 
birds. Each filled circle at the periphery indicates the mean orientation of an 
individual bird based on nine tests under the given magnetic condition. 
cand f compare the orientation of each bird in the turned magnetic field 
(CME) with the same bird’s orientation in the natural magnetic field (NMF, 
standardized to 0°). Arrows indicate the group mean vectors. The longer is 
the group mean vector, the more consistent are the orientation choices 
between individuals. Inner and outer dashed circles indicate the radii of the 
group mean vectors needed for directional significance according to the 
Rayleigh test (inner, P< 0.05; outer, P< 0.01). Radial lines flanking the 
group mean vector mark the 95% confidence interval for the group mean 
direction. 
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that expected to result from a magnetic field turn of —120°, but was 
significantly different from the same birds’ orientation under the NMF 
condition (the 95% confidence intervals of Fig. 1c, finclude 240° but do 
not include 0°). 

The sham-lesioned birds oriented north-northeast (% = 25° + 21°, 
r= 0.90, N= 10, P< 0.001; Fig. 2a) in the geomagnetic field. When 
geomagnetic north was turned to 240°, the same birds oriented 
southwest (a = 219° + 26°, r=0.83, N= 10, P<0.001; Fig. 2b), 
and when the vertical component of the geomagnetic field was 
inverted, the same birds oriented south-southeast (« = 154° + 37°, 
r= 0.69, N= 10, P< 0.01; Fig. 2c). None of these 95% confidence 
intervals overlap, so we conclude that the birds oriented significantly 
differently under the three magnetic conditions. Judging from the 
95% confidence intervals, the orientations of the CMF and IMF 
groups were not significantly different from those expected to result 
from magnetic field turns of —120° and 180°. 

In contrast to the well-oriented, sham-lesioned birds, the birds with a 
chemical lesion of cluster N oriented randomly under all of the three 
magnetic field conditions (NMF: « = 355°, r= 0.33, N= 13, P= 0.25; 
CMF: « = 318°, r= 0.40, N= 13, P= 0.13; IMF: «= 120°, r= 0.07, 
N=7, P> 0.90; Fig. 2d—f). Furthermore, the consistency of the birds’ 
directional choices between tests was significantly poorer in the cluster- 
N-lesioned birds than in the sham-lesioned birds (comparing the r 
values for the individual mean directions in NMF: t-test, Mesion = 13, 
Nenam = 10, t= 3.160, P< 0.01). Consequently, birds with lesions of 
cluster N cannot perform magnetic compass orientation in an orienta- 
tion cage. The fact that the cluster-N-lesioned birds, which possessed 
intact trigeminal nerves, did not orient indicates that information 
transmitted through the ophthalmic branch of the trigeminal nerve 
is not sufficient for magnetic compass orientation in European robins. 

After completion of the experiments, all birds that had undergone 
real (non-sham) surgery were killed for anatomical/histological 
analysis. In all trigeminal-sectioned specimens, the nerves were 
found not to have rejoined. Brains from cluster-N-lesioned birds 
were sectioned and stained for anti-human neuronal protein” 


Sham cluster N 
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NMF (MN, 0°) 
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Figure 2 | Bilateral lesions of cluster N disrupt magnetic compass 
orientation in European robins. The photos show examples of brain sections 
from a sham-lesioned (top) and an actually lesioned (bottom) European 
robin, each saggitally cut through the centre of cluster N and stained with 
anti-HuC/HuD (a neuronal marker). The tissue where cluster N should have 
been in the lesioned bird is destroyed (compare with top photo). The 
drawings indicate where in the brain the photos were taken: A, arcopallium; 
E, entopallium; H, hyperpallium; DNH, dorsal nucleus of the hyperpallium; 
1Co, intercollicular complex; M, mesopallium (MD, mesopallium dorsale; 
MV, mesopallium ventrale); N, nidopallium; OT, optic tectum; P, pallidum; 
St, striatum. Rostral, left; caudal, right. Scale bar, 500 pum. a—c, Well- 
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(anti-HuC (also known as anti-ELAVL3) or anti-HuD (anti- 
ELAVL4), Molecular Probes), which enabled us to determine the 
extent of the lesions. Most birds had well-placed lesions covering at 
least 66% of clusterN on both sides of the brain (mean + s.d., 
78 + 9%). In three of the 13 lesioned birds, less than 50% of 
cluster N was lesioned (no. 14: 63% of left side, 33% of right side; 
no. 16: 17% of left side, 24% of right side; no. 18: 35% of left side, 29% 
of right side). We note that these three birds oriented well under all 
three magnetic field conditions (N= 3; NMF: «= 27°, r= 0.82; 
CME: « = 249°, r= 0.97; IMF: « = 184°, r= 0.81; mean vectors lie 
within the 99% confidence intervals of the mean orientation of the 
sham-lesioned birds under all magnetic field conditions; Fig. 2d-f, 
open circles). 

Our results, and the fact that clusterN is part of the visual 
system'*"’, appear to strongly support the hypothesis that magnetic 
compass input is processed in the visual system of night-migratory 
passerines. However, we consider two alternative explanations. The 
first is reduced general night-vision capability. Considering that 
cluster N is part of the visual Wulst'® and that it is known to process 
night-time light-dependent information'®'’, we tested whether a 
reduction of general night-vision capability could explain the differ- 
ence in orientation performance between cluster-N-lesioned and 
sham-lesioned birds. First, we noticed that the cluster-N-lesioned 
birds also showed a high level of migratory restlessness in the funnels 
(346 + 184 (mean = s.d.) scratches per hour and per active test). In 
contrast, when we tested European robins in complete darkness, that is, 
when they were unable to see, they showed very little migratory 
restlessness (on average <20 scratches per hour). Second, Wulst lesions 
in pigeons (Columba livia) have been shown to affect the threshold 
for detecting the intensity of a dim point of light”’. Therefore, we 
performed operant conditioning tests in which European robins, one 
group with cluster N lesions and one group without, were trained to 
detect a dim point of light. Both groups could perform this visual 
discrimination task at light intensities 400 times dimmer than the light 
level under which the magnetic compass orientation tests were 
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oriented, springtime, magnetic compass behaviour of the ten birds that 
received sham cluster N lesions (MPW, direction of magnetic pole for an 
inclination compass). d-f, Non-oriented magnetic compass behaviour of the 
13 cluster-N-lesioned birds. The data are means of 14 tests of each individual 
in the NMF, 13 tests in the CMF and 8 tests in the IMF. The open circles in 
d_f show the mean orientations of the three birds in which only 20%, 32% or 
47%, respectively, of cluster N was lesioned. The dashed arrows show the 
corresponding mean vectors. g, h, Sunset orientation (GN, geographical 
north) of the sham-lesioned (g) and lesioned (h) birds (14 tests per bird). 
The dashed radial line indicates the average sunset direction during the tests. 
Other details of the circular diagrams are as in Fig. 1. 
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performed (the visual detection limit was <0.01 mW m ” for both 
groups; Supplementary Fig. 1). We therefore conclude that a lesion- 
induced reduction in visual capability is unlikely to have caused the 
observed differences in orientation performance. 

The second alternative explanation is that cluster N might not spe- 
cifically be involved in the circuit processing magnetic compass 
information. Finding directions in a magnetic orientation experiment 
is a complex task involving coordination of information from mul- 
tiple neural pathways and probably input from memory. Lesions of 
cluster N could have affected neural processes required for motivation 
to migrate or for solving orientation tasks in general, thus leading to 
disorientation even though magnetic sensing itself remains intact. To 
test the specificity of cluster N in magnetic compass orientation, we 
tested 13 cluster-N-lesioned European robins and 14 sham-lesioned 
European robins for their orientation abilities under two conditions: 
outdoors during sunset and indoors under a stationary planetarium 
sky simulating the local starry sky (Oldenburg, Germany; Figs 2 
and 3). 

During natural sunset, both the sham-lesioned and cluster-N- 
lesioned birds oriented significantly towards the north-northwest 
(sham-lesioned group: «= 353° + 38°, r=0.57, N= 14, P<0.01; 
cluster-N-lesioned group: % = 331° + 39°, r= 0.57, N= 13, P= 0.01; 
Fig. 2g, h). The mean orientation of both groups indicates a comprom- 
ise direction between phototactic tendencies towards the setting sun 
and the birds’ north-northeast migratory direction. This reaction is 
typical of outdoor sun compass orientation tests on migratory birds 
during sunset”*. The point is that the birds’ orientation was significantly 
more northerly than the sunset direction, meaning that pure phototac- 
tic orientation can be excluded (95% confidence intervals for the mean 
direction do not overlap with the sunset point, which on average was at 
278° during our experiments). 

In the planetarium, we simulated celestial north to be located at 
magnetic east so that we could determine whether orientation beha- 
viour was guided by a star compass or a magnetic compass. To 
encourage the birds to use their star compass, we added more mag- 
netic disturbance to the already significantly disturbed geomagnetic 
field inside the planetarium (Methods). The cluster-N-lesioned birds 
oriented significantly towards star north-northeast (% = 27° + 44°, 
r=0.55, N= 12, P= 0.02; Fig. 3a) and the mean orientation was 
almost identical to the direction chosen by the sham-lesioned birds 
using their magnetic compass (a = 25° + 21°; Fig. 2a). These results 
show that birds with bilateral lesions of cluster N can use their star 


compass and their sunset compass, but cannot use their magnetic 
compass to perform appropriately directed migratory restlessness 
behaviour (Fig. 3). Therefore, a generally reduced motivation or 
inability to migrate cannot explain the disorientation of birds with 
cluster N lesions. 


Starcompass Magneticcompass Sunset compass 
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Figure 3 | Birds with cluster N lesions can use their star and sun 
compasses, but not their magnetic compass. Photo from a lesioned bird as 
in Fig. 2. a, Birds with cluster N lesions tested in a planetarium simulating 
the local starry sky (ten tests per bird; STN, star north) oriented in the typical 
north-northeast spring migratory direction (~ = 27° + 44°, r= 0.55, 

N= 12, P= 0.02). b, Birds with cluster N lesions could not orient (« = 132°, 
r= 0.12, P> 0.70) using their magnetic compass. Shown are the combined 
data of Fig. 2d-f, depicted relative to the magnetic direction towards the pole 
(MPW) as European robins use an inclination compass (data from the three 
poorly lesioned birds are not included). ¢, Birds with cluster N lesions could 
also orient during sunset (« = 331° + 39°, r= 0.57, N= 13, P=0.01), 
presumably using their sun compass. 


Real cluster N 
lesions 
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Because the ophthalmic branch of the trigeminal nerve is the only 
nerve branch that innervates the candidate ferromagnetic, magneto- 
sensory structures in the upper beak’, our results show that these 
putative magnetoreceptors are neither necessary nor sufficient for 
magnetic compass orientation, and can therefore be excluded as 
the sole magnetic compass sensor in European robins. This conclu- 
sion is in line with results from other studies including, for instance, 
those on bobolinks, in which anaesthetic blockade of the trigeminal 
nerve also failed to affect compass orientation’. However, our find- 
ings do not rule out the possibility that these or other putative mag- 
netoreceptors in other regions of the body can sense geomagnetic 
information”®*”’. In fact, in pigeons, the putative magnetosensors in 
the upper beak have been strongly implicated in the sensing of non- 
compass aspects of the geomagnetic field'® (but see also refs 28, 29). 

The results of the present study, together with those which show 
that cluster N is the most active forebrain region during magnetic 
compass orientation behaviour'®’”!” and is a specialized part of 
the visual system" requiring light perceived through the eyes for 
its neuronal activation'®’’, specifically suggest that clusterN of 
European robins is an essential part of a circuit processing light- 
dependent magnetic compass information for night-time orienta- 
tion. The exact role of cluster N within this circuit has not been 
determined, but the present results raise the distinct possibility that 
this part of the visual system enables birds to ‘see’ magnetic compass 
information. 


METHODS SUMMARY 


All magnetic field conditions were produced using double-wrapped, three- 
dimensional Merritt four-coil systems” with average coil diameters of 2 m. All 
experiments were performed within the central space of the coils, where the 
heterogeneities were <1% of the applied field. Current flowed through the coils 
in all magnetic conditions. 

The operations were performed under general anaesthesia. To lesion cluster N, 
50 nl 1% ibotenic acid in 0.9% NaCl were injected with a microinjector. To 
section Vj, a small cut was made through the skin just above the eye and the 
eyeball and muscles were gently pushed to the side, so that V, could be sectioned 
and a 2-3-mm piece of the nerve removed. The extent and degree of overlap 
between cluster N and the cluster N lesions were reconstructed post mortem 
using AMIRA software (Visage Imaging). 

The magnetic compass experiments were performed in Emlen funnels inside 
four wooden huts lined with grounded aluminium shields to minimize electro- 
magnetic disturbances. Immediately before the orientation tests, we exposed all 
test birds to parts of the local evening sky. 

The sun compass experiments were performed on clear evenings around the 
time of sunset on an open field. The star compass experiments were performed in 
a planetarium simulating the local starry sky of Oldenburg. The magnetic field 
inside the planetarium was strongly disturbed. 

Two to three independent observers determined the mean directions in the 
single tests, and all oriented and active tests were used to calculate the mean 
orientation of each individual bird under each experimental condition. These 
individual mean directions are depicted as circles on the peripheries of the large 
circles in Figs 1-3. The group mean vectors were calculated by vector addition of 
individual unit vectors in each of the individual bird mean directions and divi- 
sion by the number of birds tested. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Magnetic fields. To produce the CMF condition, current flowed in the same 
direction through each subset of windings of the double-wrapped, three- 
dimensional Merritt four-coil systems”. To produce the natural geomagnetic 
field condition, the coils were turned on but the current ran through the two 
subsets of windings in opposite directions. Before each experiment, the actual 
magnetic field was measured (FVM-400, Meda) in the centre and at the edges of 
the experimental volume, within which nine Emlen funnels were placed simul- 
taneously. The actual fields experienced by the birds under the three magnetic 
field conditions were as follows (mean + s.d.): for the NMF the field strength was 
48,620 + 330nT, the inclination was 67.6° + 0.5° and the horizontal direction 
was 360° + 1°; for the CMF the field strength was 48,800 + 320 nT, the inclina- 
tion was 67.8° + 0.5° and the horizontal direction was 240° + 2°; and for the IMF 
the field strength was 48,660 + 500 nT, the inclination was —68.1° + 0.5° and the 
horizontal direction was 1° + 1°. 

Cluster N lesioning and nerve sectioning. Birds were anaesthetized by intra- 
muscular injection of ketamine (Pfizer)/Rompun (Bayer). The heads were then 
fixed in a custom-built stereotaxic apparatus. The scalp was additionally anaes- 
thetized using a local surface anaesthetic (xylocaine, Astra Zeneca), incised and 
retracted. For cluster N lesions, a small part of the skull was carefully removed 
and 50 nl 1% ibotenic acid in 0.9% NaCl solution were stereotaxically injected 
using a microinjector. The coordinates used corresponded to the ones described 
in ref. 18. Sham-lesioned birds underwent exactly the same procedures except 
that ibotenic acid was not injected. For nerve sectioning, the scalp was retracted 
and the fascia along the rim of the orbit was incised to allow gentle depression 
and retraction of the globe. The ophthalmic branch of the trigeminal nerve (V,) 
was revealed and sectioned behind the eyeball immediately before it left the orbit 
and 2-3 mm farther proximally; this piece was then removed and the cut ends of 
the proximal and distal stumps were sealed with surgical cyanoacrylate to pre- 
vent re-fusion. Sham-sectioned birds underwent exactly the same procedures 
except that the nerves were not sectioned. Finally, the skin edges were sealed and 
the birds were given 1-10 weeks to recover from surgery before taking part in any 
behavioural experiment. 

Behavioural experiments. The experimental birds were caught within 200 m of 
the testing site at the University of Oldenburg, Germany, during autumn migra- 
tion and tested during the following spring migratory season. The magnetic field 
orientation experiments were conducted in wooden huts placed on the univer- 
sity campus. The walls of the huts were lined with grounded aluminium shields, 
which acted as Faraday cages to shield non-stationary electromagnetic distur- 
bances. One hour (+10 min) before the lights went out in the bird rooms (light 
for 14h, dark for 10 h), the birds were placed outdoors in wooden transport cages 
that allowed them to see parts of the evening sky for 1h to give them the 
opportunity to calibrate their magnetic compass from the local sunset sky’. 
Immediately thereafter, the birds were placed in aluminium Emlen funnels” 
(35 cm in diameter, 15 cm high, walls inclined at 45°) and tested for 1h under 
dim light conditions (4 mW m_~’) produced by incandescent bulbs (for spec- 
trum, see Supplementary Fig. 2). 

The funnels were coated with thermal paper’ on which the birds left scratches 
as they moved. Nine European robins were tested simultaneously in each hut. 
The birds were put into a randomized funnel position each night, and were put 
into the funnels from different sides. We observed no systematic differences 
between the nine funnel positions. The birds were tested twice each night under 
the same magnetic field condition. There were no statistically significant differ- 
ences (Mardia—Watson—Wheeler tests, P>0.65 for all comparisons) or even 
indications that the orientation was systematically different during the first test 
relative to the second test, and because any given bird was tested in different huts 
during the first and second tests, both values were entered into the calculation of 
the mean for that bird under the given magnetic field condition. The first test 
started 30 min after sunset and the second test started around 2h after sunset. 
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The magnetic field condition present in a given hut was switched every second 
night, and usually different magnetic fields were present in different huts on any 
given night. 

Orientation experiments during sunset took place in spring on an open field 
near Gristede, 20 km north-northwest of Oldenburg. The tests were performed 
in the undisturbed local magnetic field with an open view of the sky on clear 
evenings and the test started 30 min before sunset and ended 30 min after sunset. 

Orientation experiments in the planetarium of Elsfleth, 28 km east-northeast 
of Oldenburg, were conducted under a stationary simulation of the local sky. The 
starry sky was projected using a Zeiss ZKP-2 projector on a dome (9m in 
diameter). Nine Emlen funnels were placed symmetrically around the projector 
10cm above the horizon plane, such that the projector was never visible from 
inside any of the funnels. To increase the likelihood that the magnetic field of the 
planetarium (already disturbed by the iron-containing projector and magnetic 
material in the walls) would not provide any useful magnetic compass informa- 
tion, heavy iron racks were used for stabilization of the set-up and strong neo- 
dymium magnets were attached under each funnel. The resulting magnetic field 
inside each funnel varied strongly in intensity (from ~44,000nT to 
~84,000 nT), direction (owing to the central placement of the magnet, the 
horizontal direction of the field within the funnels varied in all directions 
depending on exactly where the bird was located within the funnel) and inclina- 
tion (from +59° to +88°). Other testing procedures were the same as in the 
magnetic field orientation experiments, except that the birds were only tested 
once per night. Before each testing session, the projector was adjusted to the time 
the birds were tested, and on every second day, the star pattern was adjusted to 
the actual date. 

Orientation-data analysis. Two independent researchers, who did not know 
either the test condition or the operation a given bird had experienced, deter- 
mined each bird’s mean direction from the distribution of the scratches. If both 
observers considered the scratches to be randomly distributed or ifthe two mean 
directions deviated by more than 30°, a third independent researcher determined 
the mean direction. If this third individual determined a mean direction similar 
to one of the first two, and if the individual with the initially differing opinion 
also agreed with this direction, the mean of the two similar directions was 
recorded as the orientation result. If the three independent individuals could 
not agree on one mean direction, the bird’s heading was defined as random and 
excluded from the analyses (only 10% of all tests were excluded on the basis of 
this criterion). Birds with fewer than 35 scratches on the paper were considered 
inactive and also excluded from the analysis (the birds were inactive in 18% of all 
tests). 

Size and position of lesion analysis. To determine the exact extent and location 
of the lesions relative to cluster N, each brain slice was photographed with a 
digital camera (Leica DFC 320) through a stereo microscope (Leica M, Leica 
IM50). On these pictures, boundaries of the whole telencephalon, the injured 
tissue (lesion) and cluster N were marked, and the sections were aligned using 
Photoshop 6.0/Illustrator 10.0 (Adobe Systems). 

The extent of cluster N was determined by comparison with ZENK in-situ 
hybridized brain slices from birds performing magnetic compass orientation ina 
funnel’®'’. The stacks were aligned using the outline of the telencephalon. Stacks 
of each hemisphere were launched in AMIRA (Visage Imaging) and converted 
into AMIRA files (AMIRA mesh binary). The resolution of AMIRA files was 
reduced to 800 X 600 pixels and the physical distance between slides—the actual 
distance between each slide of each series—was set at 240 um. The file sequence 
was fused into one data stack. ‘Label fields’ were created, in which the marked 
boundaries were labelled and interpolated into three three-dimensional bodies: 
the lesion, cluster N and the telencephalon. On the basis of the overlap in space 
between these three volumes, the percentage of the volumetric overlap between 
the lesion and cluster N was determined. This procedure was done for each 
hemisphere of each bird separately. 
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Regulation of cortical microcircuits by unitary 
GABA-mediated volume transmission 


Szabolcs Olah’, Miklés File’, Gergely Komldési', Csaba Varga’, Rita Baldi’, Pal Barzo” & Gabor Tamas’ 


GABA (y-aminobutyric acid) is predominantly released by local 
interneurons in the cerebral cortex to particular subcellular 
domains of the target cells’. This suggests that compartmentalized, 
synapse-specific action of GABA is required in cortical networks for 
phasic inhibition’ *. However, GABA released at the synaptic cleft 
diffuses to receptors outside the postsynaptic density and thus 
tonically activates extrasynaptic GABA, and GABAg receptors, 
which include subtypes of both receptor families especially sensitive 
to low concentrations of GABA*”. The synaptic and extrasynaptic 
action of GABA corroborates the idea that neurons of the brain use 
synaptic (or wiring) transmission and non-synaptic (or volume) 
transmission for communication*’. However, re-uptake mechan- 
isms restrict the spatial extent of extrasynaptic GABA-mediated 
effects’, and it has been proposed that the concerted action of 
several presynaptic interneurons, the sustained firing of individual 
cells or an increase in release-site density is required to reach ambi- 
ent GABA levels sufficient to activate extrasynaptic receptors*”"’. 
Here we show that individual neurogliaform cells release enough 
GABA for volume transmission within the axonal cloud and, thus, 
that neurogliaform cells do not require synapses to produce inhibi- 
tory responses in the overwhelming majority of nearby neurons. 
Neurogliaform cells suppress connections between other neurons 
acting on presynaptic terminals that do not receive synapses at all in 
the cerebral cortex. They also reach extrasynaptic, 5-subunit- 
containing GABA, (GABA,s) receptors responsible for tonic inhibi- 
tion. We show that GABAas receptors are localized to neurogliaform 
cells preferentially among cortical interneurons. Neurosteroids, 
which are modulators of GABA,gs receptors, alter unitary GABA- 
mediated effects between neurogliaform cells. In contrast to the 
specifically placed synapses formed by other interneurons, the out- 
put of neurosteroid-sensitive neurogliaform cells represents the 
ultimate form of the lack of spatial specificity in GABA-mediated 
systems, leading to long-lasting network hyperpolarization com- 
bined with widespread suppression of communication in the local 
circuit. 

Uniquely among neocortical interneurons, neurogliaform cells 
evoke long-lasting inhibition in the form of an unusually slow 
GABA,-receptor-mediated component and slow GABA ;-receptor- 
mediated responses in their target neurons’*”®. A distinctive feature 
of neurogliaform cells among cortical interneurons is that they have 
very dense axonal arborizations, in which presynaptic boutons on the 
same or neighbouring axon collaterals can be found a couple of micro- 
metres from each other’*’” (Fig. 1). GABA can activate receptors 
located up to several micrometres from the release site''. We measured 
the density of terminals in neurogliaform-cell (n = 8) and basket-cell 
(n= 6) axons (421,213 = 34,289 and 78,506 + 8,423 boutons per 
cubic millimetre, respectively; P< 0.0001) and found that a single 
neurogliaform-cell axon matches the potential release-site density of 


five or six overlapping basket-cell axons. We proposed that the high 
density of neurogliaform-cell axons could counteract transmitter re- 
uptake mechanisms and that GABA released from neurogliaform cells 
acts as a volume transmitter to reach receptors at synaptic and non- 
synaptic sites in the tissue the axon intermingles with. 


Figure 1| Neurogliaform cells do not require direct synaptic junctions to 
affect target cells. a, Dense axonal cloud formed by a single neurogliaform 
cell. b, Action potentials in neurogliaform cell (NGF) 1 elicited electrical 
coupling potentials combined with inhibitory postsynaptic potentials 
(IPSPs) in NGF 2. ¢, Route of the axon of NGF 1 (red) to close appositions 
(labelled 1-3) with the dendrites of NGF 2 (blue). None of these appositions 
could be verified as a synaptic junction. d, A non-synaptic close apposition 
(labelled 1 inc) in consecutive serial ultrathin sections (la, axon of NGF 1; 
2d, dendrite of NGF 2). The axon of NGF 1 forms a synaptic junction 
(asterisk) on an unlabelled dendritic shaft. e, Three-dimensional electron 
microscopic reconstructions of 18 neurogliaform-cell axonal varicosities, of 
which only two formed synaptic junctions (arrows). 
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Potential volume transmission suggests a very high rate of functional 
coupling between neurogliaform cells and neighbouring neurons. 
When searching our database, which contains information on 204 
simultaneously recorded pairs of neurogliaform cells and other neurons 
with somata located <100 1m apart, we detected hyperpolarizing 
effects of neurogliaform cells in 178 (87%) of tested cells, an unprece- 
dentedly high proportion in paired recordings of cortical neurons’®. In 
searching for the morphological correlates of these connections, corre- 
lated light and electron microscopic analysis was performed on n= 11 
neurogliaform-cell-to-interneuron pairs (eight from rat, of which three 
featured in ref. 19, and three from human) and n= 5 neurogliaform- 
cell-to-pyramidal-cell pairs (rat). 

We detected chemical synaptic junctions in only two neurogliaform- 
to-pyramidal pairs, supporting earlier results'*, but in assessing fully 
available series of ultrathin sections by tracing neurogliaform-cell 
axons along their approach to functionally coupled neurons, we did 
not find synaptic junctions in the remaining 14 cell pairs. In these 14 
pairs, the most closely placed synapses established by the boutons of 
the neurogliaform cells were 2.7 + 1.6 um from the target dendrites 
(range, 1.1-5.3 um; Fig. 1). Proving the efficacy of our analysis, we 
confirmed the presence of gap junctions in the three pairs’? in which 
electrical coupling potentials were recorded in addition to IPSPs, even 
though ultrastructural identification of gap junctions between labelled 
neurons is more difficult than that of synapses (Fig. 1). Furthermore, 
we performed three-dimensional electron microscopic reconstruc- 
tions of randomly chosen segments of neurogliaform-cell axons. All 
examined axonal boutons contained synaptic vesicles, but we found 
that 50 boutons formed only 11 synapses, implying that the majority 
(~78%) of neurogliaform-cell axonal varicosities do not form classical 
synapses (Fig. 1). These results suggest that neurogliaform-cell axons 
do not necessarily require a synaptic contact to elicit inhibitory responses 
in target cells. 

Although suspected non-synaptic communication by neuroglia- 
form cells is consistent with the lack of detectable synapses, positive 
evidence is required to prove the volume transmission hypothesis. 
Functional testing of single-cell-initiated non-synaptic signalling works 
best on potential targets not contacted by synapses at all. Thus, we asked 
whether GABA released from neurogliaform cells modulates axon 
terminals that are not targeted in synaptic junctions in the cerebral 
cortex” but frequently express GABA, receptors'*!°*"~, We con- 
firmed that neocortical neurogliaform cells modulate their own axon 
terminals by means of GABA, receptors similar to hippocampal 
neurogliaform cells’® (Supplementary Fig. 1 and Supplementary Data), 
but the modulatory action of neurogliaform cells was not limited to 
homosynaptic silencing of axon terminals. 

Heterosynaptic or paracrine effects of neurogliaform cells on axons 
of other neurons were also suggested by experiments in which we 
simultaneously recorded from three neurons, namely a pyramidal- 
cell-to-interneuron connection (test excitatory postsynaptic poten- 
tials (EPSPs)) and a neighbouring neurogliaform cell activated 
60 ms before the first EPSP and 120 ms before the second (Fig. 2). 
Alternating trials with and without the activation of neurogliaform 
cells were performed at a very low frequency (once every 2 min) to 
avoid activity-dependent loss of neurogliaform-cell output'*®. As 
expected from the high rate of coupling of neurogliaform cells to other 
neurons, the decay phase of the hyperpolarizing effect of the neuro- 
gliaform cells overlapped with the test EPSPs; the corresponding input 
resistance changes of the neurons postsynaptic to the EPSPs were 
10+5% and 4+5%, measured 60 and, respectively, 120 ms after 
the spike in the neurogliaform cells. Accordingly, we corrected the 
amplitudes of the EPSPs and IPSPs reported below using the corres- 
ponding changes in input resistance and driving force. 

Switching on the action potential in the neurogliaform cells 60 ms 
before the spike in the pyramidal cell did not change the amplitude of 
the first test EPSPs (1 = 5; 98 + 4%) relative to control, that is, when 
the spike in the neurogliaform cell was not elicited (Fig. 2). This 
indicates that tonic inhibition through GABA, receptors potentially 
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Figure 2 | Heterosynaptic or paracrine effects of neurogliaform cells on 
axons of other neurons. a, Single neurogliaform cells (NGFs) 
heterosynaptically modulate unitary glutamatergic connections linking 
other neurons. These simultaneous triple records show a pyramidal-cell-to- 
pyramidal-cell (PYR 1-to-PYR 2) connection (test EPSPs) while the output 
of a neurogliaform cell is switched on and off 60 ms before the first pyramidal 
spike. The neurogliaform cell suppressed the amplitude of the second EPSP 
evoked by PYR 1, but not the first. b, Activation-sequence-dependent cross- 
modulation of unitary IPSP amplitudes between closely located 
neurogliaform cells. Top: activation of NGF 1 or NGF 2 followed by a spike in 
NGEF 2 or, respectively, NGF 1 resulted in sequential IPSPs in the 
postsynaptic pyramidal cell. Bottom: comparison of the amplitude of 
preceding and succeeding IPSPs elicited by NGF 1 or NGF 2 indicates 
effective suppression of follower IPSPs. 


activated by neurogliaform cells'*’* did not interfere significantly 
with test EPSPs other than by contributing to input resistance 
changes. However, the neurogliaform cells were effective in decreas- 
ing the amplitude of the second test EPSPs, timed 120 ms after the 
spike in the neurogliaform cells, to 77 + 5% (P< 0.03) of control 
(Fig. 2). The differential action of neurogliaform-cell output on the 
first and the second test EPSPs induced a change in their paired pulse 
ratio (from 85 + 20% to 69 + 22%; P< 0.01). Moreover, neuroglia- 
form cells activated 120ms before test IPSPs triggered by other 
neurogliaform cells successfully suppressed the amplitude of the test 
IPSPs to 74 + 4% (n= 10, P< 0.02) of control (Fig. 2). This resulted 
in an unusual activation-sequence-dependent cross-modulation of 
IPSP amplitudes between closely located neurogliaform cells. 

Thus, the experiments on test EPSPs and IPSPs showed that indi- 
vidual action potentials in neurogliaform cells could suppress appro- 
priately delayed responses elicited by other neurons. These effects were 
enhanced by NO-711, a GABA re-uptake blocker (Supplementary 
Fig. 2 and Supplementary Data). We also asked if basket cells could 
modulate the surrounding microcircuit similarly to neurogliaform 
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cells, but single or multiple spikes in these interneurons failed to 
modulate the amplitude of test connections (Supplementary Fig. 3 
and Supplementary Data). Furthermore, we found that GABAg recep- 
tors located on presynaptic terminals” are necessary and sufficient for 
the neurogliaform-cell-elicited heterosynaptic suppression of test 
connections, and supplementary anatomical analysis showed that 
the modulatory effect of GABA released from neurogliaform cells 
appears to be confined to the axonal cloud (Supplementary Fig. 4 
and Supplementary Data). 

Apart from presynaptic GABAg, receptors, extrasynaptically placed 
GABAags receptors” ”’ are potential targets for non-synaptically acting 
GABA released by neurogliaform cells. Tonic inhibition mediated by 
GABAgs receptors is variably present on cortical neurons**. We 
therefore applied immunocytochemistry, which showed that among 
cortical interneurons, neurogliaform cells are primary candidates for 
GABAas-receptor expression (Supplementary Figs5 and6 and 
Supplementary Data). Immunolabelling of electrophysiologically 
and anatomically identified cells confirmed that neurogliaform cells 
somatodendritically express GABA,s receptors (n=9) and that 
some (n= 3) of the neurogliaform cells that did so elicited slow 
IPSPs in simultaneously recorded GABA,s-receptor-immunonega- 
tive interneurons (Fig. 3). 

Tonic inhibition mediated by GABA,s receptors seems to bea target 
for stress-induced and ovarian-steroid-derived neurosteroids**”®”’; 
thus, these compounds should modulate the excitability of neuroglia- 
form cells. The average holding current necessary to clamp neuroglia- 
form cells (n = 7) but not other interneurons (= 10; Supplementary 
Fig. 7; ref.26) at a given holding potential increased by 24+ 5pA 
(P<0.03) after the addition of the neurosteroid tetrahydrodeoxy- 
corticosterone (THDOC; 100 nM)**”°” during blockade of GABA, 
receptors with 40 1M CGP35348 in the presence of 5uM GABA 
(Fig. 3). The effect of THDOC was reversed during blockade of 
GABA, receptors with gabazine (10 UM): average holding currents 
decreased by 8+3pA (P<0.05) after addition of THDOC. 
Furthermore, rheobasic firing of neurogliaform cells (n = 6; Fig. 3) 
but not other interneurons (m= 8; Supplementary Fig.7) required 
larger positive current injections (234 + 26 pA) in the presence of 
THDOC (20nM), relative to baseline conditions (138 + 17 pA; 
P<0.01; 40 UM CGP35348, 5 uM GABA), and the effect was abolished 
with gabazine (10 UM), indicating that GABA,s receptors could effec- 
tively control the input-output gain’**” of neurogliaform cells. 

In our test, we used low concentrations of externally added GABA 
mimicking ambient cortical levels, as in earlier experiments on 
neurosteroid modulation of tonic inhibition**”°’. However, neuro- 
gliaform output could locally produce extracellular GABA concen- 
trations effective on GABA,s receptors without external input. 
Application of THDOC (100nM) in the presence of CGP35348 
(40 1M) increased the half-width of gabazine (10 1M)-sensitive 
IPSPs elicited by single presynaptic action potentials in reciprocally 
connected pairs of neurogliaform cells from 48 + 6 ms to 59 + 10 ms 
(n= 5, P< 0.04; Fig. 3). Neurosteroid modulation of self-inhibition 
and intercellular inhibition involving neurogliaform cells confirms 
that neurogliaform cells are both sources and targets of extracellular 
GABA acting on tonic inhibition. 

Unlike other interneuron types that specifically place synapses on 
particular compartments of postsynaptic cells'’*, neurogliaform cells 
provide non-synaptic, spatially non-specific input to the entire surface 
of target cells, complementing conventional synapses". Neurogliaform 
cells release GABA, covering their axonal fields in effective concentra- 
tions, and target the overwhelming majority of nearby neurons, which 
selectively express receptors sensitive to low concentrations of the 
neurotransmitter on their various compartments**”**. Although pre- 
synaptic mechanisms producing the GABA cloud around neuroglia- 
form axons are not understood, they might involve a unique release 
mechanism with an unconventional calcium dependence'®”’. Provided 
that release and re-uptake mechanisms are similar, local GABA 
concentrations produced by distinct interneurons probably emerge 
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Figure 3 | Extrasynaptically placed GABA,; receptors are localized to 
neurogliaform cells and targeted by GABA released from neurogliaform 
cells. a, Left: GABA, 5-receptor immunoreaction on a simultaneously 
recorded and biocytin-filled neurogliaform cell (NGF) and postsynaptic 
interneuron (INT). GABA,s receptors were detected on the neurogliaform 
cell only. Middle: firing pattern of the interneuron and neurogliaform cell. 
Right: the neurogliaform cell elicited slow IPSPs in the postsynaptic 
GABA,s-receptor immunonegative interneuron. b—d, Neurosteroids alter 
the excitability and connections of neurogliaform cells through GABA,s 
receptors. b, The average current (dashed lines) required to hold a 
neurogliaform cell at the same membrane potential changed after the 
addition of the neurosteroid THDOC (100 nM) while GABAg receptors were 
blocked using 40 4M CGP35348 in the presence of 5 uM GABA. When 
gabazine (10 11M) was introduced, the effect of THDOC was reversed: the 
average holding current was smaller than that measured before THDOC 
addition. ¢c, Rheobasic firing of a neurogliaform cell required larger positive 
current injections (220 pA instead of 140 pA) in the presence of THDOC 
(20 nM) than under baseline conditions (40 uM CGP35348, 5uM GABA), 
and the effect was abolished with gabazine (10 1M). d, Application of 
THDOC (100 nM) in the presence of CGP35348 (40 1M) increased the half- 
width of gabazine (10 U.M)-sensitive IPSPs elicited by single presynaptic 
action potentials in reciprocally connected pairs of neurogliaform cells. 


at distances of about half the inter-terminal distance, meaning that 
basket cells should be less effective than neurogliaform cells around 
3 um from their terminals''. The spatial extent of axons'*'””’ suggests 
that neurogliaform cells provide a means of making synchronized 
changes in the efficacy of synaptic connections in conjunction with 
regulating dendritic excitability across distances of around 200 pum. 
In certain operational states of the microcircuit, solitary spikes in a 
single neurogliaform cell might replace the concerted action poten- 
tials of interneuron populations in modulating presynaptic terminals 
and postsynaptic domains expressing GABA receptors**?'"), 
Neurosteroids might shift the balance among the sources of ambient 
GABA by lowering the contribution of neurogliaform cells with a 
selective increase in tonic inhibition through GABAgs receptors. 
Varying neurosteroid concentrations during the ovarian cycle and 
pregnancy and as a result of stress*”” are expected to modulate the 
action of neurogliaform cells on network hyperpolarization and in 
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suppressing communication in the local circuit acting on axons of 
resident neurons or terminals of long-range projections at their 
arrival. 


METHODS SUMMARY 


All procedures were performed with the approval of the University of Szeged and 
in accordance with the US National Institutes of Health Guide to the Care and 
Use of Laboratory Animals. We obtained electrophysiological data at ~35 °C 
from up to four concomitantly recorded cells visualized in layer 2/3 of the soma- 
tosensory cortex of Wistar rats (P22—P35; P, postnatal day) as described previ- 
ously’. Presynaptic neurogliaform cells and other cell types were stimulated to 
elicit action potentials with brief (2-ms) suprathreshold pulses at intervals of 
>120s to avoid exhaustion of transmission. We corrected the amplitudes of 
postsynaptic potentials riding on top of the decay phase of preceding IPSPs using 
changes related to input resistance and driving force (2+ 1% for IPSPs and 
0.5 + 0.2% for EPSPs). Data are given as mean + s.d. The Wilcoxon test and 
Mann-Whitney U-test were used to compare data sets. Differences were 
accepted as significant if P< 0.05. 

Visualization of biocytin and correlated light and electron microscopy were 
performed as described previously'*'*. Three-dimensional light and electron 
microscopic reconstructions were carried out using NEUROLUCIDA 
(MicroBrightfield) and RECONSTRUCT (SYNAPSE WEB) software (ver- 
sion 1.1). Using three-dimensional reconstructions and/or serial ultrathin sec- 
tions, synaptic junctions were defined as 20—25-nm-wide rigid appositions 
between pre- and postsynaptic profiles with an accumulation of presynaptic 
vesicles in axon terminals. The absence of either of these criteria around pre- 
synaptic boutons was used to classify a particular axon terminal without 
synapses. 

Immunocytochemistry was performed on adult Wistar rats (n=5) and 
GABAa-receptor 6-subunit —/— mice (m= 2; donated by I. Mody, University 
of California, Los Angeles) with standard methods (Methods). For immunore- 
actions concerning 5-subunits of GABA, receptors, a primary rabbit polyclonal 
antibody was used (1:500; gift from W. Sieghart, Center for Brain Research, 
Vienna, Austria). Co-localizations were performed afterwards with primary 
antibodies as described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Electrophysiology. All procedures were performed with the approval of the 
University of Szeged and in accordance with the US National Institutes of 
Health Guide to the Care and Use of Laboratory Animals. Wistar rats (P22— 
P35) were anaesthetized by the intraperitoneal injection of ketamine (30 mg kg!) 
and xylazine (10 mgkg '), and following decapitation, coronal slices (350 um 
thick) were prepared from the somatosensory cortex. Slices were incubated at 
room temperature (25°C) for 1h in a solution composed of 130mM NaCl, 
3.5mM KCl, 1mM NaH,PO,., 24mM NaHCO3, 1mM CaCl, 3mM MgSOu,, 
10nM v(+)-glucose, saturated with 95% O, and 5% CO. The solution used 
during recordings differed only in that it contained 3mM CaCl, and 1.5mM 
MgSO,. Recordings were obtained at ~35°C from up to four concomitantly 
recorded cells visualized in layer 2/3 by infrared differential-interference-contrast 
video microscopy (Olympus BX60WI microscope, Hamamatsu CCD camera, 
Luigs & Neumann Infrapatch set-up and two HEKA EPC 10 Double patch-clamp 
amplifiers). Micropipettes (5-7 MQ) were filled with 126 mM K gluconate, 4mM 
KCl, 4mM ATP-Mg, 0.3mM GTP-NA>, 10 mM HEPES, 10 mM creatine phos- 
phate and 8 mM biocytin (pH 7.25, 300 mosmol). Signals were filtered at 5 kHz, 
digitized at 10 kHz and analysed with PULSE software (version 8.54; HEKA). 
Presynaptic neurogliaform cells and other cell types were stimulated to elicit 
action potentials with brief (2-ms) suprathreshold pulses at intervals of > 120s to 
avoid exhaustion of transmission. Postsynaptic cells were held at a membrane 
potential of —51 + 4mV. Unless specified otherwise, traces shown are averages 
of 30-200 episodes. The amplitude of postsynaptic response was defined as the 
difference between the peak amplitude and the baseline value measured before 
onset of the postsynaptic potential in control experiments (that is, when addi- 
tional preceding spikes in the neurogliaform or fast-spiking basket cells were not 
elicited). These amplitudes were compared with postsynaptic averages recorded 
with the preceding presynaptic spikes elicited in the neurogliaform or fast spik- 
ing basket cells. Amplitudes of postsynaptic potentials riding on top of the decay 
phase of preceding IPSPs were corrected using the input-resistance changes 
measured with brief hyperpolarizing pulses timed on top of the IPSPs at 60 
and 120 ms after the spike in the neurogliaform or fast-spiking basket cells. 
Data were also corrected for driving-force-related changes, which were 2 + 1% 
for IPSPs and 0.5 + 0.2% for EPSPs. Experiments with THDOC were performed 
in the presence of NBQX (10 1M) and APV (20 11M). Effects between neuroglia- 
form cells could not be adequately fitted with single exponentials; thus, half- 
widths were compared. Data are given as mean + s.d. The Wilcoxon test and 
Mann-Whitney U-test were used to compare data sets; differences were accepted 
as significant if P< 0.05. 
Histology. Visualization of biocytin and correlated light and electron micro- 
scopy was performed as described previously'*'*. Three-dimensional light 
microscopic reconstructions were carried out using NEUROLUCIDA (ver- 
sion 4.05; MicroBrightfield) with a X 100 objective; measurements of the overlap 
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between axonal and dendritic fields were helped using NEUROEXPLORER 
(version 3.23; MicroBrightfield) software. Three-dimensional electron micro- 
scopic reconstructions were performed with RECONSTRUCT (SYNAPSE 
WEB; version 1.1) software axons and were aided by the especially large range 
(+£80°) of the goniometer fitted to our electron microscope (Tecnai BioTwin 
120). Using three-dimensional reconstructions and/or serial ultrathin sections, 
synaptic junctions were defined as 20-25-nm-wide rigid appositions between 
pre- and postsynaptic profiles with an accumulation of presynaptic vesicles in 
axon terminals. The absence of either of these criteria around presynaptic 
boutons was used to classify a particular axon terminal without synapses. 
Immunocytochemistry. Adult Wistar rats (1 = 5) and GABA,-receptor 5-subunit 
—/— mice (n = 2) were perfused with fixative containing 4% paraformaldehyde in 
0.1 M phosphate buffer (PB, pH 7.3) for 10 min in deep anaesthesia. For immunor- 
eactions concerning 5-subunits of GABA, receptors, 60-|um-thick coronal sections 
were incubated in citrate buffer containing 10 mM citric acid and 0.05% Tween 20 
(Sigma) in distilled water (pH 6.0) at 95-100 °C for 10 min. After cooling to room 
temperature, sections were blocked with normal horse serum (NHS, 10%) in Tris- 
buffered saline (TBS, pH 7.4) for 1h and incubated with primary rabbit polyclonal 
antibody (1:500) diluted in TBS containing 2% NHS and 0.1% TritonX-100 for 
72 hat 4 °C. Washes were done between steps with TBS containing 0.05% Tween 20 
until incubation in a cocktail containing biotinylated donkey anti-rabbit secondary 
antibody (1:250; Jackson Immunoresearch), and thereafter with TBS only. Sections 
were treated with ABC complex dissolved in TBS (1:100; Vector Laboratories) and 
GABAgs-receptor immunoreactions were visualized using Alexa 488-conjugated 
tyramide signal amplification kits (Molecular Probes). 

Co-localizations were performed afterwards with the following primary 
antibodies: mouse anti-o-actinin (1:40,000; Sigma), goat anti-parvalbumin 
(1:5,000; PGV-214, Swant), mouse anti-calbindin (1:100; C8666, Sigma), guinea 
pig anti-vasoactive intestinal peptide (1:200; T-5030, Peninsula Laboratories), 
mouse anti-calretinin (1:1,000; 6B3, Swant), rat anti-somatostatin (1:50; 
MAB354, Chemicon), mouse anti-chicken ovalbumin upstream promoter tran- 
scription factor II (1:500; 2ZH7147H, PPMX) and mouse anti-reelin (1:50,000; 
MAB5366, Chemicon). Primaries were diluted in the cocktail described above 
and were visualized using the following secondaries: Cy3-conjugated donkey 
anti-mouse (1:500; Jackson Immunoresearch), Cy3-conjugated donkey anti- 
rabbit (1:500; Jackson Immunoresearch), Cy3-conjugated donkey anti-rat 
(1:500; Jackson Immunoresearch), Cy5-conjugated donkey anti-guinea pig 
(1:500; Jackson Immunoresearch) and Alexa 350-conjugated donkey anti-goat 
(1:500; Molecular Probes). Finally, sections were mounted on slides in Vectashield 
(Vector Laboratories). Images were made using a light microscope (BX60, 
Olympus) with a 5 objective or a confocal laser scanning microscope (1X81, 
Olympus) with a X20 (numerical aperture, 0.75) or a X40 (numerical aperture, 
1.30) objective. Automated sequential acquisition of multiple channels was used. 
Z-stack images were made up from 3—9 images in 5—45 jum depth of tissue. 
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Regulation of inflammatory responses by gut 
microbiota and chemoattractant receptor GPR43 


Kendle M. Maslowski’”, Angelica T. Vieira’, Aylwin Ng”, Jan Kranich’, Frederic Sierro', Di Yu’, Heidi C. Schilter’”, 
Michael S. Rolph’, Fabienne Mackay’, David Artis’, Ramnik J. Xavier?®, Mauro M. Teixeira* & Charles R. Mackay’ 


The immune system responds to pathogens by a variety of pattern 
recognition molecules such as the Toll-like receptors (TLRs), which 
promote recognition of dangerous foreign pathogens. However, 
recent evidence indicates that normal intestinal microbiota might 
also positively influence immune responses, and protect against the 
development of inflammatory diseases'*. One of these elements 
may be short-chain fatty acids (SCFAs), which are produced by 
fermentation of dietary fibre by intestinal microbiota. A feature 
of human ulcerative colitis and other colitic diseases is a change 
in ‘healthy microbiota such as Bifidobacterium and Bacteriodes’, 
and a concurrent reduction in SCFAs*. Moreover, increased intake 
of fermentable dietary fibre, or SCFAs, seems to be clinically bene- 
ficial in the treatment of colitis”°. SCFAs bind the G-protein- 
coupled receptor 43 (GPR43, also known as FFAR2)'°", and here 
we show that SCFA—GPR43 interactions profoundly affect inflam- 
matory responses. Stimulation of GPR43 by SCFAs was necessary 
for the normal resolution of certain inflammatory responses, 
because GPR43-deficient (Gpr43-") mice showed exacerbated or 
unresolving inflammation in models of colitis, arthritis and 
asthma. This seemed to relate to increased production of inflam- 
matory mediators by Gpr43-’" immune cells, and increased 
immune cell recruitment. Germ-free mice, which are devoid of 
bacteria and express little or no SCFAs, showed a similar dysregu- 
lation of certain inflammatory responses. GPR43 binding of SCFAs 
potentially provides a molecular link between diet, gastrointestinal 
bacterial metabolism, and immune and inflammatory responses. 
Recent evidence suggests that products of intestinal microbiota might 
positively influence inflammatory disease pathogenesis'”. To identify 
factors produced by bacteria that might be beneficial to host immune 
responses, we induced colitis chemically by adding dextran sulphate 
sodium (DSS) to the drinking water of germ-free mice. The absence of 
bacteria from these mice, and the consequences this had on immune 
responses, resulted in significantly worse colonic inflammation, com- 
pared to conventionally raised (CNV) mice. Germ-free mice treated with 
DSS had decreased weight, an increased daily activity index (DAI; a 
combined measure of weight loss, rectal bleeding and stool consistency) 
and a decreased haematocrit (Fig. 1a). Germ-free mice re-colonized with 
gut microbiota, by gavaging with CNV faeces, showed a marked reduc- 
tion in inflammation (Supplementary Fig. 1), indicating that the exacer- 
bated response related to lack of bacterial colonisation of the gut. 
Products of bacteria that have been reported to show anti-inflammatory 
properties include SCFAs, which are produced by colonic bacteria after 
fermentation of dietary fibre. Bacteria of the Bacteroidetes phylum pro- 
duce high levels of acetate and propionate, whereas bacteria of the 


Firmicutes phylum produce high amounts of butyrate’*. Germ-free mice 
do not produce SCFAs owing to a lack of enteric microbes’*. Treatment 
of germ-free mice with 150 mM acetate in the drinking water markedly 
improved disease indices, with an increase in colon length, decreased DAI 
and levels of the inflammatory mediator myeloperoxidase (MPO) 
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Figure 1| Exacerbated colitis in germ-free mice is ameliorated by acetate. 
a, Germ-free (open squares) and CNV (closed triangles) mice were given 
DSS colitis (4%), n = 7 (experimental groups). Dashed lines, control mice; 
solid lines, DSS-treated mice. The percentage weight change (left), DAI 
(middle) and haematocrit (right) were measured. b, Germ-free mice were fed 
acetate (grey squares; 150 mM; n = 3) in the drinking water or water only 
(black squares; n = 5), 5 days before and during DSS administration. 
Control fed denotes no DSS (open squares). Daily activity score (left), colon 
length (middle) and colonic MPO (right) were determined. c, MIP1a and 
TNFz levels in acetate-fed mice. Data are median = s.e.m., representative of 
two independent experiments. 
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(Fig. 1b), and decreased levels of TNFa and inflammatory MIP1la 
(also known as CCL3) (Fig. 1c). SCFAs have a well-characterized anti- 
inflammatory effect, on both colonic epithelium and immune cells'‘*””. 
Recently, SCFAs, particularly acetate (C2) and propionate (C3), have 
been found to bind and activate the G-protein-coupled receptor GPR43 
(refs 10, 11). Using an extensive data set of human and mouse immune 
cell transcription profiles'* we found that transcripts for human GPR43 
and mouse Gpr43 exhibited enhanced expression in neutrophils and 
eosinophils (Fig. 2a and Supplementary Fig. 2). Using nearest-neighbour 
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Figure 2 | GPR43 expression and role in inflammatory responses. 

a, Immune expression signature of genes encoding cellular receptors across a 
large panel of leukocyte subsets. Clustering of receptor genes exhibiting 
enriched expression in neutrophils and eosinophils reveals GPR43, along 
with other receptors important for innate immunity and chemoattractant- 
induced responses. Correlation analysis across a wider set of genes in this 
immune panel identified a rank-ordered list of the top 150 genes (Nj50) in 
the co-expression neighbourhood of GPR43. b—e, Comparison of wild-type 
(WT) and Gpr43 ’~ bone marrow neutrophils with respect to acetate- 
induced Ca?~ flux (b; MFI, mean fluorescence intensity), chemotaxis (¢, left 
panel), ROS production (d), and phagocytosis of fluorescently labelled 
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correlation analysis we found that GPR43 gene expression was closely 
regulated with receptors important for innate immunity, such as Toll- 
like receptors (TLR2 and TLR4), formyl peptide receptors (FPR1 and 
FPR2), IL8RB (also known as CXCR2) and C5aR (Fig. 2a). We con- 
structed protein interaction networks for genes co-regulated with 
GPR43, and, using a molecular complex detection and a graph-theory- 
based clustering (MCODE) algorithm”, we identified densely connected 
local sub-network clusters, revealing modules associated with apoptosis 
and innate immunity-related processes (Supplementary Fig. 3). 
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phenylacetamide 1 in human neutrophil chemotaxis. f, DSS colitis (2.5% (w/ 
v)) in wild-type and Gpr43 mice, fed with acetate or control water (Ctrl). 
Shown are colon length, the DAI, histology score and colon MPO levels. NS, 
not significant. The far right panels show representative histological sections 
from wild-type or Gpr43 / ~ mice as indicated (scale bar, 50 ttm). g, Chronic 
DSS-induced colitis (n = 7 per group, median + s.e.m.). The inset shows the 
percentage morbidity. Shown are the percentage change in weight, colon 
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and Gpr43 ‘~ mice. 
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To demonstrate that GPR43 was the relevant receptor for SCFA 
effects on immune cells, we sourced Gpr43 ’~ mice (Supplementary 
Fig. 4), and found that T- and B-cell numbers were normal, and their 
blood neutrophil numbers were in the normal range (not shown). 
Acetate induced a robust calcium flux in mouse neutrophils (as well 
as in human neutrophils and eosinophils, not shown), but not in 
neutrophils from Gpr43 ’~ mice (Fig. 2b), indicating that GPR43 
is the sole functional receptor for SCFAs on neutrophils. GPR43 has 
previously been reported to act as a chemoattractant receptor for 
acetate'', and we found that neutrophils from wild-type, but not 
Gpr43 ‘~ mice, responded chemotactically to acetate, but only at 
very high concentrations (~100—1,000 1M), and with a relatively 
low chemotactic index (Fig. 2c). This may relate to the low affinity 
of GPR43 for SCFAs”, the high concentrations of SCFAs normally 
present in tissues (0.1-10mM in serum, and 200mM in the 
colon)'**, and the need for chemoattractant receptors to sense a 
gradient. However, a recently reported synthetic agonist specific 
for human GPR43 with >100-fold potency over SCFAs” was much 
more robust in chemotaxis assays (Fig. 2c). In neutrophils from wild- 
type mice, but not Gpr43-‘~ mice, SCFAs induced release of ROS 
and increased phagocytic activity (Fig. 2d, e) similar to activities 
reported for certain other chemoattractant receptors. 

The characteristic expression pattern of GPR43 to cell types 
involved in innate immunity and inflammation, and the GPR43- 
dependent effects of SCFA neutrophil function suggested that 
GPR43 may be the relevant receptor on immune cells for the regu- 
lation of inflammatory responses by SCFAs. We induced colitis by 
adding DSS to the drinking water of Gpr43-/~ and wild-type litter- 
mate mice. In the acute phase (7 days), Gpr43 ’~ mice showed a 
marked increased in their inflammatory response, compared to 
wild-type mice (Fig. 2f), including a reduced colon length, and an 
increased DAI, more severe inflammation by histological analysis 
and increased MPO activity in the colon, indicating increased neu- 
trophil infiltration/activation (Fig. 2f). We next determined whether 
acetate protected against colitis in the acute DSS model, in a GPR43- 
dependent manner. Mice fed 200 mM acetate in their drinking water 
showed a substantial decrease in inflammation, as judged by longer 
colon length, a reduced DAI, reduced inflammatory infiltrate and less 
tissue damage, when compared to wild-type mice not fed acetate 
(Fig. 2f). Notably, this protection occurred by acetate binding to 
GPR43, because acetate had no beneficial effect in Gpr43‘~ mice 
(Fig. 2f). 

Ina chronic model of DSS-induced colitis, Gpr43‘~ mice showed 
greater morbidity (at day 8, Fig. 2g, inset), and a marked reduction in 
the ability to regain weight compared to wild-type littermates 
(Fig. 2g). By day 42, Gpr43‘~ mice showed reduced colon length, 
increased colon histology score, as well as increased MPO levels in the 
colon. In a T-cell-dependent model of colitis, the TNBS (trinitroben- 
zoic sulphonic acid)-induced model, Gpr43-’~ mice also had more 
severe disease with decreased colon length and increased colon his- 
tological scores (Supplementary Fig. 5). CD44* IL17~ T cells in the 
mesenteric lymph nodes in this model were increased in Gpr43~ 
mice, as were transcripts for IL17A, IL6, IL1B, IFNy and CCL2 in 
colon tissue (Supplementary Fig. 5). 

GPR43 is also expressed on colonic epithelium, and SCFAs affect 
several functions of these cells including proliferation, and epithelial 
barrier function”' and butyrate is a major source of energy for colo- 
nocytes. We determined the contribution of immune and non- 
immune cells to the colitis phenotype we observed in Gpr43’~ mice 
by using bone marrow chimaeras (Supplementary Fig. 6). After 
7 weeks of bone marrow reconstitution, colitis was induced by DSS 
in the drinking water. Wild-type mice reconstituted with Gpr43/~ 
bone marrow showed a similar exacerbated inflammatory response 
in the colon, compared to Gpr43-’~ mice receiving Gpr43 ‘~ bone 
marrow, demonstrating that immune cells were largely responsible 
for the phenotype observed in Gpr43"~ mice. 
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SCFA levels in the colon can be high, particularly after ingestion of 
large amounts of fibre, and SCFAs are rapidly absorbed and distribute 
systemically through the blood”. We therefore assessed peripheral 
inflammatory responses in Gpr43 ’~ mice using the K/BxN serum- 
induced model of inflammatory arthritis and an ovalbumin (OVA)- 
induced model of allergic airway inflammation. In the inflammatory 
arthritis model Gpr43 ’~ mice showed a slight delay in the onset of 
symptoms, but by day 11 inflammation in Gpr43’~ mice was markedly 
more severe than in wild-type littermates, clinically and histologically, 
and did not resolve over the 28 days of study (Fig. 3a). Gpr43’~ mice 
showed increased levels of MPO production by peripheral blood cells 
(Fig. 3a). Acetate in the drinking water 1 week before, and during, the 
induction of inflammatory arthritis reduced inflammation (Sup- 
plementary Fig. 7a). Germ-free mice given K/BxN serum also showed 
increased inflammation and a much slower resolution of inflammation 
when compared to CNV housed mice (Supplementary Fig. 7b). 
Gpr43-’~ mice also showed more severe inflammation compared with 
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Figure 3 | Inflammatory arthritis and allergic airway disease and GPR43 
deficiency. a, Inflammatory arthritis (K/BxN serum injection on day 0 and 
2) in Gpr43 ’ ~ mice (n = 5) versus wild-type littermates (mn = 3). Scores 
shown are mean = s.e.m. for each time point, representative of three 
independent experiments. Wild-type mice are represented with closed 
squares, Gpr43 ’ mice with open squares, controls with dashed lines, and 
arthritic mice with solid lines. Change in ankle thickness (top) and 
measurement of MPO in the peripheral blood (bottom) showed that both 
naive and arthritic Gpr43’~ mice had higher MPO production when 
stimulated with phorbol 12-myristate 13-acetate (PMA; bottom), indicating 
greater neutrophil activation (P < 0.001 Gpr43 ’~ control compared to 
wild-type control, P = 0.0019 Gpr43 ’~ compared to wild-type arthritic). 
Histological assessment at day 18 (right) (scale bars, 50 tum). b, OVA- 
induced allergic airway inflammation. BAL fluid cell counts (left), eosinophil 
peroxidase (EPO) activity in lung tissue (middle), and inflammation as 
scored by histology (right). The bottom panel shows representative 
haematoxylin-and-eosin-stained lung sections from wild-type and 

Gpr43 / ~ mice, and control (no OVA) mice. Scale bar, 50 um. 
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Figure 4 | GPR43 signalling and immune cell functions. a, Protein 
expression analysis using Kinex antibody microarrays. Z-score-transformed 
values reflecting positive or negative shifts in differential protein expression 
fold-changes after acetate treatment of neutrophils from wild-type mice 
compared to that from Gpr43 ’~ mice. Proteins highlighted in red or green 
indicate those with Z-scores above +1.5 or below —1.5, respectively. 
b, Apoptosis in wild-type and Gpr43 ’~ bone marrow cells, with or without 


wild-type littermates in an acute allergic airway inflammation model, 
with increased numbers of cells in the broncho-alveolar lavage (BAL) 
fluid, as well as greater levels of eosinophil peroxide and inflammatory 
cells in the lung tissue (Fig. 3b). 

The cellular and molecular basis for GPR43-SCFA effects on 
inflammatory responses was studied using Kinexus protein micro- 
arrays that interrogate more than 600 proteins and phosphoproteins. 
Because SCFAs also inhibit histone deacetylases and thereby affect cell 
transcription and functions”, we made direct comparisons between 
wild-type and Gpr43‘~ neutrophils, to identify GPR43-related sig- 
nalling pathways affected by SCFAs. Protein—protein interaction net- 
works were constructed from the set of top-ranked proteins exhibiting 
greatest positive or negative differential shifts (Z-scores =1.5 and =1.5; 
Fig. 4a). We applied the MCODE clustering algorithm to identify 
highly connected local sub-networks, which revealed interesting mod- 
ules such as the apoptosis-associated BAX—BAK1—BCL2L] cluster and 
the PRKCA—PTPN6-LCK cluster (Supplementary Fig. 8a). Consistent 
with the changes in apoptosis-related signalling molecules, acetate 
induced apoptosis in neutrophils in a dose-dependent and a GPR43- 
dependent manner (Fig. 4b), except at very high concentrations 
(30mM). There were also differences between wild-type and 
Gpr43-‘~ neutrophils with respect to several other signalling pathways 
associated with inflammation, cell migration or apoptosis in mouse, 
and in human (Supplementary Figs 9 and 10). Granulocytes from 
Gpr43‘~ mouse blood showed an increase in levels of ROS (Fig. 4c) 
and MPO (Fig. 3a). Interestingly, macrophages from germ-free mice 
also show increased production of ROS*. Gpr43 ’~ neutrophils also 
showed increased chemotaxis to the N-formyl peptide f-Met-Leu-Phe 
(fMLP) and to the complement fragment C5a (Fig. 4d), compared to 
wild-type cells. Furthermore, heat-inactivated Staphylococcus aureus 
injected into the peritoneum of Gpr43-’~ mice yielded a greater 
recruitment of neutrophils after 1h, compared to wild-type mice 
(Fig. 4e). Acetate stimulation of human neutrophils markedly reduced 
surface expression of pro-inflammatory receptors such as C5aR and 
CXCR2 (Supplementary Fig. 11), presumably through agonist- 
mediated receptor heterodimerization and internalisation. 

Commensal bacteria and vertebrate immune systems form a sym- 
biotic relationship and have co-evolved”* such that proper immune 
development and function relies on colonisation of the gastro- 
intestinal tract by commensal bacteria*’®. SCFA-GPR43 signalling 
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acetate stimulation (apoptotic cells are annexin V and propidium iodide (PI) 
double positive). c, Chemotactic response to f{MLP and C5a by wild-type and 
Gpr43 ‘~ bone marrow granulocytes. d, Recruitment of neutrophils and 
macrophages to the peritoneum in wild-type and Gpr43 ’~ mice, injected 
with 1 X 10° heat-inactivated S. aureus particles. e, Reactive oxygen species 
production by peripheral blood granulocytes. 


is one of the molecular pathways whereby commensal bacteria regu- 
late immune and inflammatory responses. GPR43 resembles another 
anti-inflammatory chemoattractant receptor, ChemR23 (ref. 27), 
although this receptor binds endogenous rather than bacterially pro- 
duced ligands. Any agents that affect gastrointestinal microbiota, and 
the production of SCFAs, might be expected to influence immune 
and inflammatory responses. It is possible that high levels of SCFAs 
such as acetate may, in addition to its direct effects on the GPR43 
response, affect the biosynthesis of endogenous fatty acids, such as 
resolvins, that modulate leukocyte functions. Levels of SCFAs in the 
gastrointestinal tract vary significantly depending on the amount of 
non-digestible fibre in the diet, and also relate to the composition of 
the gut microbiota’’. For instance, the relative proportion of Bacter- 
oidetes is decreased in obese people compared to lean people”. 
Indeed an altered composition of the gut microbiota, brought on 
by western diet, or by use of antibiotics, has been suggested as a 
reason for the increased incidence of allergies and asthma in 
humans”. SCFA—GPR43 interactions could represent a central 
mechanism to account for affects of diet, prebiotics and probiotics 
on immune responses, and may represent new avenues for under- 
standing and potentially manipulating immune responses. 


METHODS SUMMARY 


RNA extraction, preparation, hybridization and expression analysis using 
U133A and B Affymetrix GeneChips (Gene Expression Omnibus (GEO) acces- 
sion GSE3982) were performed as previously described, using an extensive col- 
lection of immune cell data sets'*. Protein expression and phosphorylation were 
assessed using Kinex antibody microarrays (Kinexus Bioinformatics) (http:// 
www.kinexus.ca/services). Gpr43‘~ mice on a C57BI/6 background were obtained 
from Deltagen (http://wwww.deltagen.com). The K/BxN inflammatory arthritis 
model, the DSS and TNBS models of colitis, and the OVA allergic airway inflam- 
mation followed standard published procedures. Statistical analyses were con- 
ducted using a Student’s two-way ttest, or two-way analysis of variance 
(ANOVA) using Graphpad Prism software. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Microarray gene expression analysis. RNA extraction, preparation, hybridiza- 
tion and expression analysis using U133A and B Affymetrix GeneChips were 
performed as previously described, using an extensive immune cell data set*’. For 
analysis of immune cell data sets, MAS5-normalized data was filtered to remove 
probe sets identified as ‘absent’ (MASS algorithm) across all samples. The data 
was further filtered by setting a minimum threshold value >20 in at least one 
sample for each probeset and a maximum-minimum expression value >100. 
The data was then log- and Z-score-transformed. Pearson correlation coeffi- 
cients were calculated for each pair of genes as well as for GPR43, and each gene 
represented on the array and k-means clustering performed*'. Heatmaps were 
generated using TreeView”. 

Kinex protein microarrays. Preparation of samples was done according to 
standard Kinexus recommendations (http://www.kinexus.ca/services). The 
normalized data from Kinexus was further filtered to remove data points with 
error ranges exceeding Kinexus’s suggested threshold of 15% for adjacent dupli- 
cate spots. Positive or negative shifts in log expression fold-changes after SCFA 
treatment of Gpr43 ’~ cell samples compared to that from wild-type samples 
were Z-score-transformed, identifying only top-scoring proteins with Z-scores 
above +1.5 or below —1.5. 

Human protein interaction networks. Protein—protein interaction networks 
were constructed by iteratively connecting interacting proteins using curated 
data from the Human Protein Reference Database (HPRD)**. The network uses 
graph theoretic representations which use abstract components (gene products) 
as nodes and relationships (interactions) between components as edges, imple- 
mented in the Perl programming language. 

Neutrophil assays. Human neutrophils were isolated from the peripheral 
venous blood of healthy volunteers, using 1% dextran sedimentation for 30 
min followed by centrifugation over 65% Percoll (Amersham Bioscience). 
Mouse neutrophils were isolated from hind leg femurs and separated by density 
centrifugation over Ficoll-Paque (Amersham Bioscience). 

Reactive oxygen species were detected using Hx CFDA (Sigma), as per manu- 
facturer’s directions. Twenty-five microlitres of 10 14M DCFDA was incubated 
with 25 ul whole blood for 20 min at 37 °C. Cells were then fixed, and red blood 
cells lysed. Cells were resuspended in FACS buffer. 

Phagocytosis was assessed using fluorescently labelled Bioparticles, Escherichia 

coli K-12 strain or S. aureus (wood strain without protein A)-BODIPYFL con- 
jugates (Molecular Probes). In vitro: 25 pl of whole blood was incubated with 
3 X 10° Bioparticles for 30 min, then analysed by flow cytometry. Chemotaxis 
was assessed using 3 AM Multiscreen-MIC 96-well plates (Millipore). Bone mar- 
row neutrophils (3-5 X 10°) were in the top chamber, with chemoattractants in 
bottom chamber (fMLP, C5a, acetate, concentrations as shown in figures), cells 
and chemoattractants were in Chemotaxis buffer (50:50 RPMI and M199 + 2% 
BCS). Cell migration was assessed by flow cytometry as described previously**. 
Apoptosis was assessed using Annexin V-FITC and propidium iodide as per 
manufacturer’s instructions (Becton Dickinson). 
Animals and models. All experimental procedures involving mice were carried 
out according to protocols approved by the relevant Animal Ethics Committees. 
Gpr43/~ mice ona C57BI/6 background were obtained from Deltagen. Housing 
and maintenance of germ-free mice followed previously described procedures**”*. 
Germ-free mice were raised and housed germ free, as described previously*’. 

Acute DSS colitis: mice were fed DSS in drinking water (percentage indicated in 
each figure) for 7 days and were monitored daily for stool score and visible bleeding. 
Stools were scored between 0 and 4, with 0 being normal up to 4 being diarrhoea, as 
described previously**. At 7 days mice were killed and colons measured and assessed 
histologically using standard procedures. For bone marrow chimaeras, Gpr43/~ 
(CD45.2) and congenic C57Bl/6 wild-type mice (CD45.1) were irradiated with 9 Gy 
then reconstituted with donor bone marrow (10’ cells) for 7 weeks. Reconstitution 
was checked by flow cytometry, using CD45 to determine bone marrow origin. For 
acetate-treatment experiments mice were fed acetate in the drinking water (con- 
centration indicated in figure legends) for 5-7 days before DSS administration, and 
also in combination with DSS. MPO was detected as previously described”’. Colon 
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histology scoring was performed from adapted methods previously described. 
Damage was scored using the following system. Mucosal ulceration: 0, no injury; 
1, focal injury; 2, multifocal injury; 3, diffuse ulceration/infiltration. Depth of injury: 
0, no injury; 1, mucosal involvement only; 2, mucosal and submucosal involvement; 
3, transmural involvement. 

Chronic DSS colitis: Mice were fed DSS on days 0-6 (4%), 20-24 (2%) and 
35-39 (4%), and killed on day 42. Mice were given normal drinking water in the 
rest periods. Disease was monitored as per the acute model. 

TNBS-induced colitis: mice were sensitised by applying a mixture of acetone/ 
olive oil (50:50) with TNBS (Sigma) (50:50, total) on shaved skin between 
shoulder blades. Seven days later, mice were challenged intra-rectally with 
2.5mg TNBS with 50% ethanol, 3.5cm from the anal verge. Mice were fasted 
overnight before the intra-rectal challenge, and given 5% dextrose in the drink- 
ing water. Mice were killed 3 days after TNBS challenge. 

For the K/BxN inflammatory arthritis model, 150 ul serum from K/BxN arth- 
ritic mice was used to induce experimental arthritis in recipient mice, and disease 
progression was monitored". The clinical score was calculated for each mouse 
by summing the scores for the four paws: 0, normal joint; 1, mild-to-moderate 
swelling of the ankle and/or one swollen digit; 2, swollen ankle or swelling in two 
or more digits; 3, severe swelling along all aspects of paw or all five digits swollen. 
For SCFA-feeding, sodium acetate was dissolved in drinking water at 200 mM 
(or as indicated) and fed to mice 1 week before colitis or arthritis induction, as 
well as during disease monitoring. Myeloperoxidase bioluminescence was 
detected in peripheral blood as described previously”. 

Allergic airway disease was induced using OVA/alum as previously described* 
with modification. In brief, OVA/alum was injected interperitoneally on days 0 
and 7, followed by intranasal challenge on days 12 to 15 and mice were killed on 
day 16. Eosinophil peroxidase was detected as previously described”. 
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Resolvin D2 is a potent regulator of leukocytes and 


controls microbial sepsis 


Matthew Spite’, Lucy V. Norling”, Lisa Summers’, Rong Yang’, Dianne Cooper’, Nicos A. Petasis°, 


Roderick J. Flower”, Mauro Perretti? & Charles N. Serhan! 


A growing body of evidence indicates that resolution of acute 
inflammation is an active process'”. Resolvins are a new family of 
lipid mediators enzymatically generated within resolution networks 
that possess unique and specific functions to orchestrate catabasis, 
the phase in which disease declines”*. Resolvin D2 (RvD2) was 
originally identified in resolving exudates, yet its individual contri- 
bution in resolution remained to be elucidated. Here, we establish 
RvD2’s potent stereoselective actions in reducing excessive neutro- 
phil trafficking to inflammatory loci. RvD2 decreased leukocyte— 
endothelial interactions in vivo by endothelial-dependent nitric 
oxide production, and by direct modulation of leukocyte adhesion 
receptor expression. In mice with microbial sepsis initiated by 
caecal ligation and puncture, RvD2 sharply decreased both local 
and systemic bacterial burden, excessive cytokine production and 
neutrophil recruitment, while increasing peritoneal mononuclear 
cells and macrophage phagocytosis. These multi-level pro-resolving 
actions of RvD2 translate to increased survival from sepsis induced 
by caecal ligation and puncture and surgery. Together, these results 
identify RvD2 as a potent endogenous regulator of excessive inflam- 
matory responses that acts via multiple cellular targets to stimulate 
resolution and preserve immune vigilance. 

Ungoverned inflammation is an underlying component of many 
pathologies, such as cardiovascular disease, diabetes and sepsis". It’s 
now recognized that resolution of inflammation is an active programme 
controlled by temporal and spatial production of specialized chemical 
mediators”**®. Recently, autacoids endogenously generated from 
omega-3 essential fatty acids, namely resolvins, were identified during 
the resolution phase of inflammation that actively promote catabasis via 
potent pro-resolving and anti-inflammatory actions’. RvD2, bio- 
synthesized from docosahexaenoic acid (DHA), was originally iden- 
tified during resolution’. Its complete stereochemistry and actions 
remained of interest. To this end, we investigated whether RvD2 pre- 
serves host immune function to facilitate resolution of inflammatory 
sepsis. 

First, the complete stereochemistry of endogenous RvD2 was deter- 
mined by physical matching with compounds prepared by total 
organic synthesis (Supplementary Fig. la) from enantiomerically 
and geometrically pure starting materials in accordance with the basic 
structure determined in resolving exudates’ (Fig. 1). This approach 
was needed because the nanogram amounts of endogenous RvD2 
isolated precluded direct nuclear magnetic resonance (NMR) analysis. 
The double-bond geometry of synthetic material was validated by 'H 
NMR (Supplementary Fig. 1b). The biosynthesis of RvD2 involves 
17-lipoxygenation of DHA to 17S-hydroperoxy-4Z, 7Z, 10Z, 13Z, 
15E, 19Z-docosahexaenoic acid (17-HpDHA), which is enzymatically 
transformed to a 7(8)epoxide-containing intermediate*’ in human 


leukocytes. This enzymatic activity involves 5-lipoxygenase (LOX) 
and its epoxide-generating activity*. These steps can occur within a 
single cell type or via transcellular biosynthesis. For example, eosi- 
nophils, rich in 15-LOX, can convert DHA to 17-HpDHA, which 
polymorphonuclear neutrophils (PMNs) can convert to RvD2 
(Fig. 1a). Actively phagocytosing PMNs converted resolvin precursor 
17-HpDHA to RvD2 as determined by lipidomics based on liquid- 
chromatography tandem mass spectrometry (LC-MS/MS). A total 
ion chromatogram (mass-to-charge ratio m/z=375 [M-H]) of 
human leukocyte-derived RvD2 is shown (Fig. 1b), with characteristic 
conjugated tetraene ultraviolet (UV)-chromophore (absorbance Amax 
at 301 nm with shoulders at 289 nm and 315 nm). Synthetic material 
showed an exclusive and prominent peak with retention time and UV 
spectrum essentially identical to leukocyte-derived RvD2 (Fig. Ic). 
Co-injection of synthetic and leukocyte-derived RvD2 led to an 
increase in intensity and co-elution (Fig. 1d). To further establish 
the physical properties, their tandem mass spectra were analysed, with 
essentially identical mass spectrum, and diagnostic ions in agreement 
with original assignments for endogenous RvD2 (Fig. le, f)*. To 
validate the biosynthetic pathway, activated human PMNs were 
incubated with deuterium-labelled 17S-HpDHA-d; or DHA-ds; 
RvD2 containing the d5 label was biosynthesized (Fig. 1g), the parent 
ion [M-H] increased to m/z 380, and neutral loss ions reflective of 
d5-containing fragments (Supplementary Fig. 2). Next, to further 
confirm the structural assignment*”, derivatized RvD2 was subjected 
to gas chromatography-mass spectrometry (GC/MS). The derivatized 
RvD2 C-value was 25.2 + 0.1 and its spectrum (Supplementary Fig. 3) 
showed diagnostic ions at m/z 479, 435, 229 and 171 (see ref. 3). 
Collectively, matching of synthetic and leukocyte-derived RvD2 (by 
retention time, mass spectral diagnostic ions and derivatization) 
established the complete stereochemistry and double-bond geometry 
of endogenous RvD2 as 7S, 16R, 17S-trihydroxy-4Z, 8E, 10Z, 12E, 
14E, 19Z-docosahexaenoic acid. 

RvD2 displayed potent actions in microbial peritonitis, with a drastic 
~70% reduction in zymosan-stimulated PMN infiltration at doses as 
low as 10 pg (Fig. 2). Importantly, the A10-trans-RvD2 isomer was 
essentially inactive, indicating that specific geometry of endogenous 
RvD2 is required for bioactivity. To determine whether RvD2 decreases 
leukocyte-endothelial interactions, microcirculation in the cremaster 
muscle was analysed. Platelet activating factor (PAF; 100 nM)*"® super- 
fusion caused increased leukocyte adherence and emigration that was 
markedly reduced by 1nM RvD2 (Fig. 2c, d). Representative micro- 
circulation before and after RvD2 superfusion is shown in the 
Supplementary movies. These potent RvD2 actions were recapitulated 
with human cells to identify potential cellular directed actions (that is, 
endothelial cells and PMN). RvD2 potently reduced PAF-stimulated 
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Figure 1| Stereochemical assignment, biosynthesis and total organic 
synthesis of RvD2. a, Rv biosynthesis illustrating potential PMN and 
eosinophil transcellular biosynthesis. Blue, stereochemistries for 
assignments. b, Ion chromatograms (m/z 375) depicting human leukocyte- 
derived RvD2 and RvD1 with UV spectra (insets) and related isomers with 
the @-2 OH metabolites of PD1 and 7,17-diHDHA. c, d, RvD2 prepared by 


capture and adhesion of PMN by human umbilical vein endothelial 
cells (HUVECs) under flow (Supplementary Fig. 4)''. We note that 
RvD2 also reduced complement-mediated (with complement C5a) 
PMN-endothelial interactions (Supplementary Fig. 5a—c), a key medi- 
ator in sepsis'*. Consistent with the impact of RvD2 on leukocyte— 
endothelial interactions, RvD2 diminished PAF-stimulated CD62L 
(L-selectin) shedding on isolated human PMN, and CD18 (ITGB2 , 
also knownas integrin beta 2) surface expression (Fig. 2e, f). RVD2 alone 
did not alter PMN adhesion molecule expression (n = 3, not shown). 
To obtain further evidence for direct actions of RvVD2 on human leu- 
kocytes, we monitored reactive oxygen species (ROS). Importantly, 
RvD2 did not stimulate extracellular superoxide, and it potently 
reduced C5a-stimulated extracellular superoxide generation (see below 
and Supplementary Fig. 5e, f). 

Next, we assessed the contribution of nitric oxide, an established anti- 
adhesive mediator'*" in RvD2-reduced leukocyte adherence in post- 
capillary venules. The non-selective nitric oxide synthase inhibitor 
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m/z 


total organic synthesis (c) and co-injection with leukocyte-derived RvD2 
(d). e, f, MS/MS of leukocyte-derived RvD2 (e) and synthetic RvD2 (f) with 
prominent ions at m/z 357 [M-H-H,0], 339 [M-H-2H,0], 313 [M-H-CO,- 
H,0], 295 [M-H-CO,-2H,O], 277, 259 [277-H,O], 247, 233 [277-COp], 141 
and 113. g, Leukocyte-derived RvD2-d;. Blue, d5-labelled ions. 
Representative of n = 3-5. 


L-NAME, before addition of RvD2, partially reversed the decreased 
leukocyte adherence and emigration (Fig. 3a, b). To obtain additional 
evidence for nitric oxide generation by RvD2 in vivo, vascular fluor- 
escence was monitored (see Methods). Topical administration of RvD2 
(100 pg per ear) increased fluorescence intensity, whereas lower doses 
(1 pg and 10 pg) were ineffective (Supplementary Fig. 6a). L-NAME 
given before topical RvD2 application abolished this response (Sup- 
plementary Fig. 6b), indicating that RvD2-stimulated vascular responses 
at this dose were nitric-oxide-dependent. We note that intravenous 
injection of RvD2 at doses that inhibited PMN infiltration in peritonitis 
(10 pg) did not increase fluorescence intensity, indicating that only local 
elevated doses of RvD2 stimulated vascular responses (not shown). 
Additionally, RvD2 superfusion (1 nM) did not cause an increase in 
vascular permeability (Supplementary Fig. 6e). Topical RvD2 (10 pg 
or 100 pg) did not induce leukocyte infiltration into ear skin compared 
to chemoattractant leukotriene B, (Supplementary Fig. 6f). These 
results demonstrate that high focal delivery of RvD2 stimulates rapid 
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Figure 2 | RvD2 potently reduces leukocyte-endothelial interactions to 

reduce microbial peritonitis. a,b, Leukocyte infiltration in peritonitis. 

*P <0.05, **P < 0.01 ANOVA. b, Equidose comparison (100 pg) of RvD2 and 

A10-trans-RvD2. ¢, d, Leukocyte trafficking in vivo. PAF-stimulated (100 nM) 

leukocyte adherence (¢) and emigration (d) with or without RvD2 or RvD1. 

*P <0.01 (PAF versus PAF + RvD2 1 nM), #P < 0.05 (PAF versus 

PAF + RvD1 30 nM) and tP < 0.05 (PAF versus PAF + RvD2 30 nM) ANOVA. 

e, f, Adhesion receptor surface expression. Results (n = 3-6) are mean + s.e.m. 


nitric oxide production consistent with its anti-adhesive effects but not 
to a level that is pro-inflammatory. 

Corroboratory results were obtained with HUVECs whereby RvD2 
dose-dependently stimulated nitric oxide generation (Fig. 3c), suggest- 
ing that topical actions were probably mediated via endothelial nitric 
oxide synthase (eNOS). To test this, peritonitis was evaluated in mice 
deficient in eNOS. In agreement with an earlier report’*, no changes 
were observed with respect to leukocyte infiltration between wild-type 
and eNOS '~ (also known as Nos3'~) mice. RvD2 reduction in leu- 
kocytes was eliminated in eNOS ‘~ mice (Fig. 3d), an effect that has also 
been reported for aspirin and local aspirin-triggered lipoxins’*®. Notably, 
RvD2 also stimulated vasoprotective prostacyclin (6-keto-PGF,,; 
Supplementary Fig. 7a); this dose-response curve proved to be bell- 
shaped like other lipid mediators'*®. RvD2-stimulated prostacyclin 
and nitric oxide were sensitive to pertussis toxin, implicating a role 
for G-protein coupled receptor(s) (Supplementary Fig. 7b, c). Thus, 
RvD2 regulates leukocyte adherence via both direct actions on PMN 
(see above) and endothelial vasoactive substances. 

Next, anti-inflammatory and pro-resolving actions of RvD2 were 
evaluated in caecal ligation and puncture (CLP), an established murine 
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Figure 3 | Modulation of leukocyte trafficking by RvD2 is nitric oxide 
dependent. a, b, Leukocyte trafficking in vivo. Cremasters were superfused 
with PAF (100 nM), L-NAME (100 1M) and RvD2 (1 nM) and leukocyte 
adhesion (a) and emigration (b) was quantified (n = 3-5). *P < 0.05 two- 
way ANOVA. ¢, Nitric oxide (NO,, nitrate or nitrite) generation in primary 
HUVECs incubated with RvD2 (n = 4-6). ***P < 0.001, one-way ANOVA. 
d, Zymosan-stimulated PMN infiltration after administration of RvD2 
(100 ng; intravenously) in wild-type and eNOS /~ mice (n = 3-5). Results 
are mean + s.e.m. *P < 0.05, two-tailed unpaired Student’s t-test. 


microbial sepsis that closely resembles human pathology’”"*. Omega-3 
fatty acids are beneficial in some inflammatory conditions, including 
sepsis'’*', although the mechanistic basis underlying protection is 
still emerging. Indeed, RvD2 significantly reduced the amount of 
live aerobic bacteria in both blood and peritoneum at 12h post- 
CLP, whereas A10-trans-RvD2 was essentially not active (Fig. 4a, b). 
This was associated with a significant reduction in total leukocytes and 
specifically, PMN infiltration into the peritoneum (Fig. 4c, d). 
Interestingly, the ratio of mononuclear cells to PMN was increased 
with RvD2 (Fig. 4d, inset), similar to results obtained with sterile 
zymosan-stimulated peritonitis (Supplementary Fig. 9a). Intra- 
peritoneal delivery of RvD2 at 1h post-CLP also reduced both blood 
and peritoneal bacteria (Supplementary Fig. 8). Macrophages have an 
important role in the clearance of bacteria, cellular debris and apop- 
totic PMNs to facilitate inflammation resolution'*. RvD2 treatment 
promoted phagocyte-dependent bacterial clearance observed in 
inguinal lymph nodes (Supplementary Fig. 10d). Evidence for direct 
macrophage actions were obtained in vitro, where RvD2 potently 
enhanced macrophage phagocytosis of opsonized-zymosan (Sup- 
plementary Fig. 9b). 

To obtain additional evidence for this pro-resolution role of RvD2, 
cytokines were monitored during CLP both locally (peritoneum) 
and systemically (plasma). RvD2 drastically reduced levels of 
pro-inflammatory cytokines associated with poor outcomes in 
sepsis'*, namely the interleukins IL-6, IL-1, IL-23 and tumour nec- 
rosis factor TNF-« (Fig. 4e and Supplementary Fig. 10a). RvD2 
reduced cytokine levels while enhancing bacterial clearance, a res- 
ponse also observed for macrophage scavenger receptor A”. Hence, it 
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Figure 4 | RvD2 reduces bacterial levels, systemic inflammation and 
enhances survival in microbial sepsis. a, b, Aerobic bacteria levels in blood 
(a) and peritoneal exudates (b) from sham or CLP-operated mice + RvD2 
methyl ester (RvD2-Me) (100 ng; intravenously) at 12 h (n = 5-6 per group). 
ND, not detected; NS, not significant. c, d, Peritoneal leukocyte differentials, 
ratio of mononuclear to polymorphonuclear cells inset. (n = 3-5). e, Plasma 


is plausible that RvD2 prevents persistent amplification signals 
downstream of pattern-recognition receptors, dampening the res- 
ponses of classically activated macrophages’?*. 

RvD2 also drastically decreased IL-17, as well as IL-10, which is of 
interest because of its detrimental impact on survival in sepsis”. Thus, 
RvD2 differs in action from lipoxin Ay, which stimulates IL-10 
(ref. 26). High levels of both pro- and anti-inflammatory cytokines, 
including IL-10, are predictive of early mortality in sepsis, and dimi- 
nishing IL-10 levels proved beneficial in sepsis*’*. Pro-inflammatory 
mediators, including prostaglandin E, (PGE,) and LTBy, were also 
decreased in peritoneum by RvD2 (Fig. 4f, g and Supplementary 
Fig. 10b, c). Interestingly, in addition to macrophage-directed actions, 
RvD2 directly enhanced PMN Escherichia coli phagocytosis that was 
accompanied by an increase in intracellular ROS (Fig. 4h, i). RvD2 
does not possess direct antibacterial activity compared to ampicillin 
(Supplementary Fig. 11). RvD2-treated CLP mice also showed pro- 
tection at 12h post-CLP from hypothermia (Fig. 4j). Accordingly, 
RvD2 dramatically increased survival rates among CLP-operated mice 
(Fig. 4k) and activity levels 12 h post-CLP were resumed (Supplemen- 
tary movie 3). 
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cytokine levels (n = 3-6). f, g, Peritoneal PGE, and LTB, levels. 

h, i, Phagocytosis of E. coli and intracellular ROS in human PMN. 

j, Temperatures of mice 12 h post-CLP. k, Kaplan—Meier survival analysis of 
vehicle- (n = 13) and RvD2-treated (n = 12) CLP mice. Results are 

mean + s.e.m. a-g, *P < 0.05 two-tailed unpaired Student’s t-test. 

h, j, *P < 0.05 by one-way ANOVA. k, *P < 0.05 one-tailed log-rank test. 


The present results establish the complete stereochemistry of 
endogenous RvD2 and its potent stereoselective actions facilitating 
resolution. Local and systemic bacterial burden in microbial sepsis 
were controlled and significantly dampened with RvD2. This potent 
D-series resolvin protected CLP-mice from excessive leukocyte infil- 
tration and overzealous cytokine production, and also enhanced 
clearance of microbes, thus preventing sepsis-induced lethality. 
Sepsis remains a clinical challenge, with high mortality rates and 
increasing prevalence’. Given the uncontrolled inflammatory 
pathogenesis of sepsis, anti-inflammatory therapies are used for 
sepsis management in humans, but have ultimately failed owing 
primarily to sustained immunosuppression®. Overall, these results 
indicate that RvD2 is a potent endogenous mediator which actively 
promotes the resolution of inflammation, suggesting new thera- 
peutic approaches that do not compromise host defence. 


METHODS SUMMARY 

Intra-vital microscopy. Experiments were approved by and performed under 
guidelines of the Ethical Committee for Use of Animals, Barts and The London 
School of Medicine and Home Office regulations (Guidance on the Operation of 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 461|29 October 2009 


Animals, Scientific Procedures Act, 1986). Intra-vital microscopy was used to 
observe actions of RvD2 on leukocyte responses stimulated by PAF (100 nM C16 
form C2sH54NO7P; Sigma; PAF administered 1 h before RvD2 was given at time 0) 
within the cremasteric microcirculation of C57 BL/6 mice. Mice were anaesthe- 
tized with xylazine (7.5mgkg') and ketamine (150mgkg '), and cremaster 
prepared”. In some experiments, fluorescein isothiocyanate (FITC)-albumin 
(1 mg) was administered intravenously to assess vascular leakage. 

Caecal ligation and puncture. CLP was performed in male FVB mice’’, in 
accordance with the Harvard Medical Area Standing Committee On Animals 
Protocol #02570. The caecum was ligated below the ileocaecal valve for mid- 
grade sepsis'’. A through and through puncture was performed with a 20-gauge 
needle, followed by one additional puncture in the distal tip of the caecum. Mice 
received saline (500 tl subcutaneously) followed by intravenous administration 
of vehicle (0.1% ethanol), or RvD2 methy] ester (100 ng) at the time of puncture. 
In some experiments, RvD2-Me (1 |g) was administered intraperitoneally 1h 
post-CLP. At 12h, rectal temperature was measured, blood collected by cardiac 
puncture and peritoneal exudates obtained. Blood and peritoneal bacteria levels 
were determined by growth on tryptic soy agar plates. Plasma and peritoneal 
cytokine levels were determined by Searchlight array. Peritoneal cells were dif- 
ferentiated using Wright-Giemsa staining. 

Statistics. Data are mean + s.e.m. Multiple group comparisons were made using 
one-way or two-way analysis of variance (ANOVA) followed by Dunnett’s or 
Bonferroni post tests where appropriate and direct comparisons made using a 
two-tailed unpaired Student’s t-test. Kaplan-Meier survival curves were ana- 
lysed using a one-tailed log-rank test. In all cases, a Pvalue <0.05 was considered 
significant. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

RvD2 total organic synthesis. Total organic synthesis of RvD2 will be reported 
elsewhere. Briefly, stereochemically pure RvD2 methyl ester was prepared using 
chiral starting materials having the same stereochemistry of the three OH groups 
and using stereochemically controlled processes (Supplementary Fig. 1a). Thus, 
the synthesis started with the nucleophilic opening of enantiomerically pure 
protected glycidol 1 with a protected derivative of pentynoic acid 2, followed 
by functional group manipulations to form intermediate 3. Synthesis of the 
intermediate that contains the 16,17-diol moiety was prepared starting with a 
protected form of 2-deoxyribose 4, which was converted to intermediate 5 using 
a Wittig reaction that selectively formed a C=C bond with a Z-geometry. 
Conversion of 5 to alkyne intermediate 6, followed by palladium-mediated 
Sonogashira coupling of 6 with iodide 3 gave compound 7, the acetylenic pre- 
cursor of RvD2. Removal of the protective groups and selective hydrogenation to 
convert the alkyne moiety to a C=C bond with a Z-geometry gave the RvD2 
methyl ester (RvD2-Me). This compound was purified by high-performance 
liquid chromatography (HPLC) and fully characterized with NMR spectroscopy 
and mass spectrometry (Supplementary Fig. 1 and Fig. 1c and f). This is a 
different synthetic strategy for RvD2 than reported in ref. 9. The A10-trans- 
RvD2 was prepared by isomerization at room temperature (21 °C) and exposure 
to light overnight, and was isolated using reverse phase (RP)-HPLC before 
bioassay. 

RvD2 biosynthesis. Isolated human PMNs were isolated from healthy volunteers 
(BWH protocol # 88-02462) suspended in DPBS+/+ (25 x 10° cells ml!) and 
incubated with 5 jig 17S-hydroperoxy-4Z, 7Z, 10Z, 13Z, 15E, 19Z-docosahexaenoic 
acid (17-HpDHA) and/or 17-HDHA and stimulated with zymosan (100 ig) or 
A23187 (5M) for 30 min at 37 °C (ref. 3). Incubations were stopped with cold 
methanol and samples were solid-phase extracted*'. 

Mediator lipidomics. LC/UV/MS/MS-based mediator lipidomic analysis was 
performed with an HPLC (Shimadzu LC20AD) connected inline with a UV 
diode array detector (Agilent G1315B), coupled to a hybrid quadrupole time- 
of-flight mass spectrometer (QStar XL; Applied Biosystems) equipped with a 
Phenomenex Luna C18(2) column (2mm X 150mm X 3 tm). Acquisition was 
carried out in negative ionization mode. The mobile phase consisted of meth- 
anol/water/acetic acid (60:40:0.01, v/v/v) and was ramped to 85:15:0.01 over 
30 min and to 100:0:0.01 over the next 5 min at a flow rate of 2001 min '. 
The flow rate was increased to 400 pl min! over the next 10 min. The operating 
parameters were: gas] 35.00, gas2 55.00, curtain gas 40, electrospray voltage 
—4,200 V, evaporation temperature 400 °C, decluster potential —35 V, and col- 
lision energies were optimized individually. GC/MS analysis was carried out with 
an HP 6890 gas chromatography system equipped with an HP-5MS capillary 
column (0.25 mm inner diameter X 30 m, Agilent) and an HP5973 mass selective 
detector (Hewlett-Packard)*’. 

Targeted LC-MS/MS-based lipidomics of CLP exudates. Exudate lavages 
(2 ml) were collected and immediately added to 2 volumes of cold methanol 
for extraction and work-up as in ref. 31. LC-MS/MS identification was acquired 
with an Agilent 1100 series HPLC paired with an ABI Sciex Instruments 3200 Q 
TRAP linear ion trap quadrupole mass spectrometer. The column (Agilent 
Eclipse Plus C18, 4.6mm X 50mm X 1.8m) was eluted at a flow rate of 
0.4mlmin | with methanol/water/acetic acid (60/40/0.01;v/v/v) ramped to 
80/20/0.01 (v/v/v) after 5 min, 95/5/0.01 (v/v/v) after 8 min, and 100/0/0.01 after 
14min to wash the column. Instrument control and data acquisition were 
performed using Analyst 1.4.2 software. Ion pair transitions from previously 
reported multiple reaction monitoring methods were used for profiling and 
quantitation of PGE, (351.2/189.2) and LTB, (335.2/195.2). Criteria for iden- 
tification were: liquid chromatography retention time and a minimum of six 
fragment diagnostic ions on the MS/MS spectrum matching those of synthetic 
standards. Deuterated 5S-HETE (Cayman Chemicals, 2 ng) was added before 
extraction as internal standard for recovery calculations. 

Murine peritonitis and macrophage phagocytosis. Peritonitis was assessed 
using male FVB mice (Charles River) or eNOS’ mice and their wild-type 
littermates (C57BL/6J; Jackson Labs). Vehicle (1.0% EtOH) or RvD2 (0.001— 
100 ng per mouse) were administered intravenously followed by intraperitoneal 
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administration of zymosan A (1 mg; Sigma). Peritoneal lavages were collected 
after 4h and leukocyte infiltration was assessed by light microscopy, followed by 
differential analysis using Wright—Giemsa staining. For macrophage phagocyt- 
osis, resident peritoneal macrophages were harvested from naive mice and incu- 
bated with RvD2 (0.01-100nM) for 15min at 37°C before the addition of 
opsonized FITC-labelled zymosan. After 30 min, extracellular fluorescence was 
quenched by trypan blue and fluorescence was determined using a Victor3 plate 
reader (PerkinElmer). 

Direct actions of RvD2 on human endothelial cells and PMN. Primary 
HUVECs (Lonza) were incubated with or without pertussis toxin (100 ng ml ', 
12h) with vehicle alone (0.1% EtOH) or RvD2 (0.01-100.0nM) for 30 min at 
37°C and supernatants were analysed for NO, (nitrate or nitrite) by the Greiss 
reaction (Biomol) and 6-keto PGF,,, by ELISA (Neogen). Human PMN were 
isolated from healthy donors by Ficoll gradient and incubated with PAF (1 nM) 
with or without RvD2 (0.01-10 nM) for 15 min at 37 °C and the surface expres- 
sion of CD62L and CD18 was determined by flow cytometry using phycoerythrin- 
conjugated anti-CD18 (6.7) and FITC-conjugated anti-CD62L (Dreg 56; BD 
Biosciences) or appropriate isotype control (IgGl«). To assess leukocyte— 
endothelial interactions, the flow chamber assay was used'’, human PMN were 
stimulated before flow with PAF (1 nM) or C5a (0.1 nM) for 15 min. 
Extracellular superoxide assay. PMN were incubated with 5 1M lucigenin in a 
thermostatted (37°C) luminometer (Wallac VICTOR2 1420 Multilabel 
Counter, Perkin Elmer). After 15min, PMN were treated with vehicle, C5a 
(10nM) or RvbD2 (1-10nM) and monitored for 20 min. Alternatively, PMN 
were incubated with RvD2 (1-10 nM) for 15 min before vehicle or C5a (10 nM). 
Assessment of in vivo vascular responses by fluorescence imaging. Male FVB 
mice (6-8 weeks) were anaesthetized with pentobarbital (50 mgkg ', intraper- 
itoneally) and injected intravenously with FITC-albumin (2.5 mg) alone or plus 
L-NAME (24 mgkg — ') Acetone or RvD2-methyl ester (20 il at indicated doses) 
was applied to the inner ear. Fluorescence was assessed using the Night Owl LB 
981 NC 320 Molecular Light Imager (Berthold Technologies) and images were 
acquired for 1,500 ms (every 5 min) using a HQ 475 excitation filter (Chroma) 
and Winlight Software. The mean change in fluorescence intensity was normal- 
ized to acetone control. In some experiments, RvD2 (10 pg; intravenously) was 
administered in conjunction with FITC-albumin. 

In separate experiments, topical application of LTB, (1.0 jug; Cayman), RvD2- 
Me (0.1 or 0.01 ng) or acetone were applied to the inner ear. After 24h, 6mm 
punch biopsies were obtained, flash frozen in liquid N>, stored at —80°C and 
myeloperoxidase activity was assessed as in ref. 32. 

E. coli phagocytosis and generation of intracellular ROS. Adherent human 
PMN were pre-incubated with RvD2 (0.1-100 nM) or vehicle for 15 min before 
incubation with E. coli (JM109; 50:1 ratio) at 37 °C for 60 min. PMN were rinsed, 
fixed and permeabilized (BD Cytofix/Cytoperm; BD Biosciences), and E. coli 
levels were assessed using FITC-conjugated anti-E. coli antibody (GTX40856; 
GeneTex) by flow cytometry. To assess intracellular ROS generation, PMN were 
incubated with 5 uM carboxy-H2DCFDA (C400; Invitrogen) for 30 min before 
incubation with RvD2 and E. coli, and probe oxidation was determined using a 
Victor3 plate reader (Perkin Elmer). 

Antibacterial susceptibility test. Mid-logarithmic phase E. coli (5 X 10’ colony- 
forming units) was inoculated on LB agar plates, discs containing RvD2 (0.1- 
100 ng) or ampicillin (10 1g) were placed on top, and the zone of clearing was 
assessed after overnight incubation. 

Immunohistochemistry. Inguinal lymph nodes and spleens were excised 12h 
after CLP and fixed in 3% formalin. Tissues were paraffin-embedded, sectioned 
and slides were stained with haematoxylin and eosin and Gram by the 
Department of Pathology at the Children’s Hospital Boston. 


31. Serhan, C.N., Lu, Y., Hong, S. & Yang, R. Mediator lipidomics: search algorithms 
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Epigenetic reversion of post-implantation epiblast to 
pluripotent embryonic stem cells 


Siqin Bao'*, Fuchou Tang'*, Xihe Li’, Katsuhiko Hayashi’, Astrid Gillich', Kaigin Lao’ & M. Azim Surani* 


The pluripotent state, which is first established in the primitive 
ectoderm cells of blastocysts, is lost progressively and irreversibly 
during subsequent development’. For example, development of 
post-implantation epiblast cells from primitive ectoderm involves 
significant transcriptional and epigenetic changes, including 
DNA methylation and X chromosome inactivation’, which create 
a robust epigenetic barrier and prevent their reversion to a 
primitive-ectoderm-like state. Epiblast cells are refractory to leuk- 
aemia inhibitory factor (LIF)-STAT3 signalling, but they respond 
to activin/basic fibroblast growth factor to form self-renewing epi- 
blast stem cells (EpiSCs), which exhibit essential properties of epi- 
blast cells** and that differ from embryonic stem (ES) cells derived 
from primitive ectoderm’. Here we show reprogramming of 
advanced epiblast cells from embryonic day 5.5—7.5 mouse embryos 
with uniform expression of N-cadherin and inactive X chro- 
mosome to ES-cell-like cells (rESCs) in response to LIF-STAT3 
signalling. Cultured epiblast cells overcome the epigenetic barrier 
progressively as they proceed with the erasure of key properties of 
epiblast cells, resulting in DNA demethylation, X reactivation and 
expression of E-cadherin. The accompanying changes in the tran- 
scriptome result in a loss of phenotypic and epigenetic memory of 
epiblast cells. Using this approach, we report reversion of estab- 
lished EpiSCs to rESCs. Moreover, unlike epiblast and EpiSCs, 
rESCs contribute to somatic tissues and germ cells in chimaeras. 
Further studies may reveal how signalling-induced epigenetic 
reprogramming may promote reacquisition of pluripotency. 

Previous studies showed that epiblast cells, unlike primitive ecto- 
derm cells, are refractory to LIF-STAT3 signalling; instead they 
respond to activin/basic fibroblast growth factor (bFGF) to generate 
EpiSCs**, which are more like epiblast with an inactive X chro- 
mosome, and differ from ES cells. Here we re-examined advanced 
post-implantation epiblast cells to see if they could revert to ES-cell- 
like cells in response to LIF-STAT3 signalling. 

First, epiblast tissue carrying an Oct4—APE-green fluorescent 
protein (GFP) reporter® was isolated from mouse embryos on embr- 
yonic days (E) E5.5—E7.5. This reporter, with only the distal enhancer 
for Oct4 (also known as Pou5f1), shows preferential expression in the 
primitive ectoderm, primordial germ cells (PGCs) and ES cells, but 
not in the epiblast or EpiSCs°. Notably, the distal enhancer constitutes 
an ‘enhanceosome’ representing the densest binding locus for the key 
pluripotency-specific transcripts in ES cells’, which makes it likely that 
its activation will only occur if all pluripotency factors are expressed 
optimally; some of these must be lacking or suboptimal in the epiblast 
and EpiSCs. 

Next, the epiblast tissue was dissected to remove the most prox- 
imal region (the site of existing PGCs and PGC precursors”) and the 
outer visceral endoderm (Fig. 1a). Notably, unlike previous studies 


where the epiblast tissue was left intact**, we trypsin-digested indi- 
vidual epiblast into a single-cell suspension to break up existing cell— 
cell interactions and promote establishment of a new signalling- 
induced transcriptional network in vitro. The resulting cells were 
cultured in LIF—fetal calf serum (FCS) on mouse embryonic fibro- 
blast (MEF) feeder cells, a standard condition used for the derivation 
of ES cells from primitive ectoderm, and of induced pluripotent stem 
(iPS) cells from somatic cells**’. After 4-7 days, most cultures 
revealed large colonies (Fig. la and Supplementary Table 1) with 
many alkaline-phosphatase-positive cells (Fig. 1a), but no detectable 
Oct4—APE-GFP expression, indicating that the distal enhancer was 
yet inactive (Fig. la). We could propagate the cultured epiblast (cEpi) 
colonies in LIF—FCS after collagenase treatment without detectable 
changes for at least 20 passages. 

After culture of cEpi cells for 14-35 days, we started to detect 
clusters of GFP-positive cells within cEpi colonies (Fig. 1b, c), indi- 
cating activation of the distal enhanceosome of the Oct4—APE-GFP 
reporter®. Subsequent culture of GFP-positive cells was carried out 
after disruption of cEpi colonies by treatment with trypsin, which is 
detrimental for cEpi cells but promotes propagation of ES-cell-like 
cells. With further passaging, we established ES-cell-like cells with 
uniform GFP expression (Fig. 1b). We call these cells reprogrammed 
epiblast ES-cell-like cells (rESCs). 

The frequency of rESC derivation was relatively high at around 22— 
36%, which notably did not diminish with developmental age from 
E5.5 to E7.5 (Supplementary Table 1). Furthermore, the epiblast cells 
at the start were uniformly negative for Oct4d-APE-GFP expression 
and positive for N-cadherin and inactive X-chromosome (see 
below). If reversion had occurred from rare epiblast cells, we might 
have seen a reduction in the frequency of rESC derivation. Notably, 
no ES-cell-like cells have previously been reported from embryos as 
late as E7.5; rather, they have only been derived from primitive ecto- 
derm present in the inner cell mass in blastocysts””’. Furthermore, 
pluripotent EPL’° and FAB-SC"' were derived from pre-implantation 
or implanting blastocysts and not from post-implantation embryos 
(Supplementary Table 2); their epigenetic state was not reported. 

To gain an insight into the reversion process, we examined changes 
in gene expression (Fig. 2a, b). Transcriptome analysis revealed that 
cEpi cells, like EpiSCs**, were closely related to the epiblast. Thus, 
cEpi cells showed strong expression of Eomes, Fef5, Sox17, Gata6, 
Leftyl and Cerl. However, there was little expression of stella (also 
called Dppa3), Pecam1, Rex] (also called Zfp42) and Fbxo15, but their 
expression increased in rESCs with a concomitant loss of Fgf5, Eomes 
and Sox17 (Fig. 2a). Expression of key pluripotency genes Oct4, Sox2 
and Nanogalso increased in rESCs (Fig. 2a and Supplementary Fig. 1, 
2b). Expression of Eomes and Fef5 was slightly higher in early 
passage rESCs (passage 4 (P4)) compared to the levels in P24 cells 


'Wellcome Trust/Cancer Research UK Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK. College of Life Science, Inner Mongolia University/ 
Mengniu RB CO. Ltd., West No. 1 Daxue Road, Huhhot, Inner Mongolia 010021, China. 7Molecular Cell Biology, Applied Biosystems, 850 Lincoln Centre Drive, Foster City, California 
94404, USA. +Present address: Department of Anatomy and Cell Biology, Graduate School of Medicine, Kyoto University, Yoshida-Konoe-Cho, Sako-Ku, Kyoto 606-8501, Japan. 


*These authors contributed equally to this work. 


1292 


©2009 Macmillan Publishers Limited. All rights reserved 


NATURE| Vol 461|29 October 2009 


Single-cell 


a _E6.5 embryo Epiblast suspension Day 6 cEpi colony 
5 ¥ f " ’ “7 


MU OF 


Day 21 in culture Day 22 in culture 


Oct4—APE-GFP- 


eX 
Self-renew 


cEpi 


Epiblast 


(Fig. 2a), suggesting a loss of residual ‘memory’ of their epiblast 
origin (see below). Thus, comprehensive microarray analysis con- 
firmed that rESCs are similar to ES cells and differ from cEpi cells and 
EpiSCs (Fig. 2b and Supplementary Fig. 3). These changes in the 
transcriptome are consistent with the distal enhancer-driven activa- 
tion of the Oct4-APE-GFP reporter in rESCs. 

We next asked if LIF-STAT3 signalling is critical for reprogram- 
ming, and found that STAT3 is phosphorylated in cEpi cells, suggest- 
ing that they respond to LIF signalling (Supplementary Fig. 4a). 
Notably, addition of the Janus protein tyrosine kinase inhibitor 
(JAK inhibitor 1; Calbiochem) that prevents phosphorylation of 
tyrosine 705 of STAT3 initially allowed some cEpi colonies to develop, 
but they gradually differentiated and failed to form rESCs (Sup- 
plementary Table 3). Thus, LIF-STAT3 seems to be crucial for the 
propagation and reprogramming of cEpi cells to rESCs. Furthermore, 
culture of rESCs with the JAK inhibitor caused a marked reversal 
towards the cEpi-like transcriptome (Supplementary Fig. 4b). A num- 
ber of STAT3 targets and their expression have been identified in ES 
cells’, including Fbxo15, Rex1 and Stat3 itself, as well as the epigenetic 
modifiers Lin28, Ezh2 and Mbd3, suggesting that STAT3 has the poten- 
tial to influence the transcriptional and epigenetic state of cEpi cells. 

Next, we examined epigenetic changes in the epiblast during rever- 
sion to rESCs. Notably, reactivation of the late-replicating inactive 
X chromosome’””* during reversion to rESCs would indicate a major 
epigenetic change'*. Consistently, we found that all the E6.5 epiblast 
cells (96 of 96 cells; Supplementary Fig. 5a) had the characteristic 
accumulation of histone H3 lysine 27 trimethylation (H3K27me3) 
associated with the inactive X chromosome (ref. 15). After 12 days, 
nearly all cEpi cells (99 of 100 cells; Fig. 3a) still had the H3K27me3 
‘spot’, which declined to 62% after 25 days (37 of 60 cells), and to 
only 9% after 35 days of culture (7 of 80 cells). This suggests continu- 
ing epigenetic reprogramming towards rESCs, which uniformly 
lacked the H3K27me3 ‘spot’. These observations on H3K27me3 sug- 
gest initiation of X reactivation, a hallmark of epigenetic reprogram- 
ming, although further studies are needed to confirm completion of 
the process. Similar changes are also seen after somatic nuclear trans- 
plantation into oocytes, in mouse iPS cells, and in ES-cell-somatic 
cell hybrids'*. 

We also examined DNA methylation of the promoter regions of 
stella and Rex1; both of these genes (and others like Pecam1) are 
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Figure 1| Reprogramming epiblast cells from 
mouse E6.5 embryos to generate rESCs. 

a, Derivation of cEpi cells from E6.5 epiblast. 
Epiblast tissue had the proximal region (white 
line, far left panel) and visceral endoderm 
removed; a single-cell suspension (black arrow) 
was cultured, which formed cEpi colonies. Note 
alkaline phosphatase (AP)-positive cells in cEpi 
colonies in the far right panel. b, Derivation of 
rESCs from cEpi colonies. Note the appearance of 
clusters of Oct4-APE-GFP-positive cells in cEpi 
colonies (black arrowheads), and corresponding 
white arrowheads for GFP in the panel below. 
Scale bars, 100 ttm (a and b). ¢, Schematic 
representation of reprogramming of epiblast 
through cEpi cells, and finally rESCs in response 
to LIF-STATS3 signalling. 
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repressed in the epiblast but active in primitive ectoderm and ES 
cells*’. Although stella and Rex] were unmethylated in the epiblast, 
they became transiently methylated in cEpi cells before undergoing 
demethylation (Fig. 3b), which is consistent with the activation of the 
Stella—GFP reporter in rESCs (Supplementary Fig. 2a). Once estab- 
lished, the rESC epigenotype was stable, heritable and distinct from 
EpiSCs, which retain key properties of epiblast cells, including inac- 
tive X, and in which stella and Rex1 are methylated and repressed 
(Figs 2a and 3b). These findings are relevant to human ES cells, which 
resemble mouse EpiSCs but not mouse ES cells or rESCs. 

To observe the dynamic nature of reprogramming to rESCs, we also 
examined changes in the expression of E-cadherin and N-cadherin 
(Fig. 4a). Whereas expression of both E-cadherin and N-cadherin was 
detected in E6.5 epiblast uniformly, trypsin-digestion of these cells 
before culture led to the loss of these adhesion molecules. During 
subsequent culture, we detected heterogeneous N-cadherin expres- 
sion in cEpi cells, but in continuing culture there was a complete loss 
of N-cadherin, which was replaced by uniform expression of 
E-cadherin in rESCs, consistent with the evidence that LIF-STAT3 
promotes upregulation of E-cadherin". 

Next, we asked if rESCs might have originated from early PGCs 
because they can undergo dedifferentiation into pluripotent embryonic 
germ cells that are similar to ES cells. To reduce this likelihood, the 
epiblast tissue from E6.5—E7.5 mouse embryos was dissected away from 
the most proximal region, the site of PGC precursors and PGC, respec- 
tively. In particular, E7.5 epiblast also shows a loss of competence to form 
additional PGCs*". Second, PGCs require bFGF and stem cell factor for 
proliferation and dedifferentiation into pluripotent embryonic germ 
cells, and cannot survive in culture conditions used for cEpi cells. 
Furthermore, rESCs, unlike embryonic germ cells, retain methylation 
of imprinted genes”? (Supplementary Fig. 5c). Thus, cumulative 
evidence makes it unlikely that rESCs could originate from PGCs. 

In view of our observations, we asked if established EpiSCs cultured 
in activin/bFGF could also undergo reversion to rESCs in response to 
LIF-STAT3. We chose two EpiSC lines: one with the Oct4d—-APE-GFP 
reporter (passage 20) and the other with an X-GFP reporter; the 
latter was FACS sorted to establish lines with repressed reporter on 
the inactive X chromosome (passage 23) (Supplementary Fig. 5b). 
When these EpiSCs with stably repressed GFP reporters were cultured 
for 10-20 days (Supplementary Fig. 6a, b), we detected GFP-positive 
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Figure 2 | Changes in gene expression profile. a, Reverse transcription real- 
time PCR of marker genes in EpiSCs, cEpi cells and rESCs at early (P4) and 
late (P24) passages, and ES cells. Note progressive loss of markers of epiblast 
detected in cEpi cells and EpiSCs (at the top) and enhancement of expression 
of genes in rESCs that resemble those expressed in ES cells. b, Whole-genome 
cluster analysis of transcriptomes of cEpi cells, rESCs at early (P4) and late 
(P24) passages, and ES cells. The numbers correspond to Pearson 
correlation coefficients between different CDNA samples. Note that rESCs 
resemble ES cells but not cEpi cells, which are more like the original epiblast 
cells as described above. 


cells, from which we established rESCs as before. Re-expression of the 
X-GFP reporter indicates X reactivation. Furthermore, we found that 
the stably repressed stella and Rex1 became de-repressed after DNA 
demethylation during reversion to rESCs (Figs 2a and 3b). Thus, 
unlike previous studies”, we found that reversion of EpiSCs is 
possible in response to LIF-STAT3 without exogenous transcription 
factors. We had previously shown that ES cells in LIF—FCS exist in a 
metastable state and fluctuate between ES cell and epiblast-like states 
but without proceeding completely to the EpiSC-like state. Because 
the presence of feeder cells apparently helps to promote a shift towards 
an ES-cell-like state*®, it is possible that they help to promote the 
reprogramming process. 

Finally, we tested the developmental potential of ESCs in chimaeric 
embryos. Using rESCs derived from epiblast with the ROSA-LacZ 
reporter, we found an extensive contribution to developing embryos 
and in adults with germline transmission (Supplementary Fig. 7 and 
Supplementary Tables 4-6). Although early passage rESCs contri- 
buted to the extra-embryonic ectoderm (a trophectoderm derivative) 
in E6.5 embryos (Supplementary Fig. 7b), this was not seen with late- 
passage rESCs (Supplementary Table 6 and Supplementary Fig. 7c). 
Thus, a transient memory of the epiblast origin in rESCs is lost with 
progressive changes in the transcriptome and the epigenetic state. 
Most notably, rESCs from EpiSCs could also participate in chimaeras 
and contribute to the germ line in E13.5 embryos (Supplementary 
Fig. 7i and Supplementary Table 5), which is not possible with 
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Figure 3 | Epigenetic changes during reprogramming of epiblast cells. 

a, Female cEpi cells exhibit uniform accumulation of H3K27me3 associated 
with the inactive X (white arrowhead), which is gradually lost from 
individual cells during culture and finally lost in rESCs. Scale bar, 20 jum. 
DAPI, 4,6-diamidino-2-phenylindole. b, Changes in DNA methylation of 
stella and Rex1 during reprogramming of epiblast. Although stella and Rex1 
are repressed in epiblast cells, these loci are initially unmethylated; they 
undergo DNA methylation transiently in cEpi cells and stably in EpiSCs. 
Reprogramming to form rESCs results in loss of DNA methylation. 


EpiSCs**, indicating that rESCs derived from EpiSCs undergo a stable 
reversion to an ES-cell-like state. 

Post-implantation epiblast cells and established EpiSCs can ‘over- 
come’ a robust epigenetic barrier and undergo reversion to rESCs in 
response to LIF-STAT3 (Fig. 4b). This observation is significant for 
human ES cells, which resemble mouse EpiSCs**”*. Further studies 
may provide critical insights into signal-induced epigenetic repro- 
gramming, including DNA demethylation and X-reactivation, which 
also represent a major barrier during reprogramming of somatic cells 
to iPS cells’*°?”*8. Notably, specification of PGCs from epiblast is 
also accompanied by similar epigenetic reprogramming events by 
a different mechanism involving the expression of key germ-cell 
determinants, including Blimp1/Prdm1 and Prdm14 (refs 2, 21); 
these genes repress the somatic program and initiate epigenetic 
reprogramming during specification of PGCs from epiblast cells”. 
As a result, PGCs are epigenetically similar to primitive ectoderm 
cells (Fig. 4b, refs 29-31). Epigenetic changes in PGCs, and during 
the experimental reversion of epiblast and EpiSCs to rESCs, may 
provide novel insights into diverse mechanisms underlying epige- 
netic reprogramming of epiblast cells, and contribute generally to 
the understanding of the role of epigenetic mechanisms in other 
contexts. 
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Figure 4 | Dynamic changes of cell surface markers and model of 
reprogramming. a, Dynamic changes in cell surface adhesion molecules. Both 
E-cadherin and N-cadherin are detected uniformly in E6.5 epiblast, which is 
undetectable in single-cell suspension. During culture, N-cadherin expression 
is heterogeneous in cEpi cells and eventually disappears completely and is 
replaced by E-cadherin in rESCs. Scale bars, 20 jim. b, Schematic 
representation of reprogramming of ‘epiblast’ cEpi cells and EpiSCs to rESCs. 
Note the epigenetic and transcriptional changes during reprogramming of cEpi 
cells and EpiSCs. Specification of primordial germ cells (PGC) also results in 
epigenetic reprogramming, including expression of stella and X reactivation. 
ICM, inner cell mass; Xa, active X chromosome; Xi, inactive X chromosome. 


METHODS SUMMARY 


Post-implantation epiblast from mouse E5.5—E7.5 embryos was isolated by cut- 
ting out the extra-embryonic and proximal epiblast cells with glass needles. The 
epiblast was then treated with EGTA and trypsin, and dissociated into single cells 
by pipetting with a hand-pulled glass capillary. The single-cell suspension from 
individual epiblast tissue was cultured in standard ES cell medium with LIF and 
FCS on feeder cells. These cells formed colonies called cultured epiblast cells 
(cEpi), and were regularly passaged on feeder cells at 36-day intervals. After 
culture of cEpi cells for 14—35 days, about 10-50 GFP-positive cells appeared in 
individual cEpi colonies. When these GFP-positive cell clusters grew to 100- 
200 um diameter, they were treated with trypsin, and the resulting cells were 
cultured to produce GFP-positive colonies. We call these cells rESCs. 
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A regulatory circuit for piwi by the large Maf gene 


traffic jam in Drosophila 


Kuniaki Saito’, Sachi Inagaki’, Toutai Mituyama*, Yoshinori Kawamura’”, Yukiteru Ono’, Eri Sakota”, 
Hazuki Kotani’, Kiyoshi Asai”°, Haruhiko Siomi' & Mikiko C. Siomi’”” 


PIWlI-interacting RNAs (piRNAs) silence retrotransposons in 
Drosophila germ lines by associating with the PIWI proteins 
Argonaute 3 (AGO3), Aubergine (Aub) and Piwi'”. piRNAs in 
Drosophila are produced from intergenic repetitive genes and 
piRNA clusters by two systems: the primary processing pathway 
and the amplification loop’’. The amplification loop occurs in a 
Dicer-independent, PIWI-Slicer-dependent manner***. However, 
primary piRNA processing remains elusive. Here we analysed 
piRNA processing in a Drosophila ovarian somatic cell line where 
Piwi, but not Aub or AGO3, is expressed; thus, only the primary 
piRNAs exist. In addition to flamenco, a Piwi-specific piRNA cluster’, 
traffic jam (tj)’, a large Maf gene, was determined as a new piRNA 
cluster. piRNAs arising from tj correspond to the untranslated 
regions of tj messenger RNA and are sense-oriented. piRNA loading 
on to Piwi may occur in the cytoplasm. zucchini", a gene encoding a 
putative cytoplasmic nuclease, is required for tj-derived piRNA 
production. In tj and piwi mutant ovaries, somatic cells fail to inter- 
mingle with germ cells and Fasciclin III is overexpressed. Loss of tj 
abolishes Piwi expression in gonadal somatic cells. Thus, in gonadal 
somatic cells, tj gives rise simultaneously to two different molecules: 
the TJ protein, which activates Piwi expression, and piRNAs, which 
define the Piwi targets for silencing. 

Genetic studies have shown that piwi and aub are essential in germ- 
line stem-cell self-renewal and pole-cell formation, respectively'’”’. 
Mutations introduced into piwi and aub cause de-repression of retro- 
transposons and a loss of piRNA accumulation in ovaries*. A recent 
study has revealed that strong loss-of-function mutations in AGO3 also 
increase expression of selfish genetic elements in germ lines”. Thus, the 
PIWI proteins, with their associated piRNAs, function in retrotran- 
sposon silencing. piRNA production in Drosophila ovaries occurs in a 
Dicer-independent manner®. A model for piRNA biogenesis—the 
piRNA amplification loop**—was proposed as a result of deep- 
sequencing and bioinformatic analyses of Drosophila piRNAs. In this 
model, Aub/Piwi and AGO3 reciprocally guide the 5’ end formation of 
piRNAs. 

Classification of piRNAs according to their origins indicated that 
piRNAs derived from a particular piRNA cluster locus—flamenco 
(flam)—on the X chromosome are exclusively loaded on to Piwi'”, 
indicating that those piRNAs are produced by a pathway independent 
of the amplification loop'”. This pathway is called the primary pro- 
cessing pathway’. Recently, two independent groups deduced the 
existence of the primary piRNA processing pathway from extensive 
bioinformatic analyses of piRNAs in a broad range of piRNA mutants; 
these studies reconfirmed that primary piRNAs derived from flam are 
most likely loaded directly on to Piwi and not further amplified>®. 


However, a molecular mechanism of the primary processing pathway 
remains elusive. 

We established a stable cell line of ovarian somatic cells (OSCs) 
from the parental cell line fGS/OSS, comprising germline stem cells 
and sheets of somatic cells (OSS)!*. f{GS/OSS culture was shown to be 
Vasa-positive’*, whereas our OSCs are Vasa-negative (Supplementary 
Fig. la). OSCs express Fasciclin III (FasIII; Supplementary Fig. 1b) and 
undergo rounds of passage in culture for several months. All these data 
support the idea that OSCs contain only mitotically active early follicle 
(somatic) cells. 

Piwi is expressed in somatic gonadal cells , whereas Aub and 
AGO3 are not expressed in this cell type**’®. Western blot analysis 
revealed that Piwi, but not Aub and AGO3, was detectable in OSCs 
(Supplementary Fig. 1a). As in ovaries, Piwi in OSCs was localized in 
the nucleus (Supplementary Fig. 1c). The absence of AGO3 expres- 
sion in OSCs was further confirmed by polymerase chain reaction 
with reverse transcription (RT-PCR; Supplementary Fig. 1d, e). 

Piwi in OSCs was detected in a form bound to small RNAs of 24-30 
nucleotides (Fig. 1a). The size distribution of these small RNAs was 
very similar to that of gonadal Piwi-associating piRNAs (Piwi 
piRNAs)**"*. In addition, they showed resistance to periodate oxida- 
tion and f-elimination treatments, which are hallmarks of 2’-O- 
methyl modification at the 3’ end, as in ovarian piRNAs*'”'* (Sup- 
plementary Fig. 1f). Thus, the small RNAs associated with Piwi in 
OSCs could be categorized as piRNAs and, importantly, these 
piRNAs are produced in an Aub/AGO3-independent manner. 

We examined whether the nuclear localization of Piwi is required 
for piRNA production and for Piwi loading in OSCs. A mutant of 
Piwi (Piwi-AN) in which its putative nuclear localization signals were 
deleted, resulting in cytoplasmic localization of the mutant (Fig. 1b), 
was loaded with piRNAs in a similar manner to wild-type Piwi 
(Fig. 1c). We also observed that a Slicer-deficient Piwi mutant— 
Piwi-DDAA, where two aspartic acids (D614 and D685) in the 
PIWI domain, which are required for Slicer activity, are altered to 
alanines—was loaded with piRNAs, similarly to wild-type Piwi 
(Fig. 1c). Depletion of endogenous Piwi from OSCs did not affect 
piRNA loading on to a double mutant of Piwi (Piwi-AN-DDAA) 
(Supplementary Fig. 1g). Piwi does not seem to homodimerize in 
vivo (data not shown). These results support a model in which the 
primary piRNA processing and the piRNA loading on Piwi may 
occur in the cytoplasm in a Piwi-Slicer-independent manner. 

Piwi-associating piRNAs in OSCs (OSC piRNAs) are mainly 
derived from the antisense strand of retrotransposons (Fig. 2a and 
Supplementary Fig. 2a), similar to the derivation of ovarian Piwi 
piRNAs**'*””. The size distribution of OSC piRNAs is shown in 
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Figure 1| Piwiin OSCs is associated with endogenous small RNAs. a, Piwi- 
associating small RNAs in OSCs were visualized by **P-labelling. n.i., non- 
immune IgG was used as a negative control. b, Subcellular localization of 
Myc-tagged wild-type Piwi, Piwi-AN and Piwi-DDAA in OSCs. Scale bars, 
20 tum. DAPI, 4’, 6-diamidino-2-phenylindole. c, Myc-tagged wild-type Piwi, 
Piwi-AN and Piwi-DDAA expressed in OSCs are bound with piRNAs. 

M, molecular mass marker. 


Deleted peptide sequence (1-72) 


MADDQGRGRRRPLNEDDSSTSRGSGDGPRVKVERGS 
SSGDPRADPRI EASRERGALEEAPRREGGPPERKPW 


Supplementary Fig. 2b. Examination of their nucleotide bias indi- 
cated that OSC piRNAs mostly have a bias for U as the first nucleotide 
in the sequence (1st-U), but no other prominent bias was observed 
throughout their entire sequence (Fig. 2b). Exclusion of piRNAs with 
lst-U from the piRNA pool did not uncover any obvious bias, 
including 10th-A (Supplementary Fig. 2c). piRNA pairings through 
the ten nucleotides from the 5’ ends were negligible (Supplementary 
Fig. 2d). Thus, Piwi does not self-amplify piRNAs. All these observa- 
tions correlate well with the data obtained from a recent deep- 
sequencing study that was performed using a Drosophila ovarian 
somatic sheet (OSS)’. 

The origins of OSC piRNAs were examined (Fig. 2c and 
Supplementary Fig. 3). Unique mapping (see Methods for definition) 
of OSC piRNAs on the Drosophila genome revealed that flam is the 
main source (1,365 perfectly matched and 186 one-base mismatched 
piRNAs; Supplementary Figs 3 and 4), as it is for ovarian Piwi 
piRNAs? and OSS piRNAs’. In addition to flam, we found another 
locus on chromosome 2L that also produces piRNAs uniquely 
mapped to the particular region (322 perfectly matched and 29 
one-base mismatched piRNAs; Fig. 2c; chromosome 2L, unique). 
This locus corresponds to the 3’ untranslated region (UTR) of a 
protein-coding, single-exon gene, traffic jam’ (tj, Fig. 2d). 

TJ is a soma-specific large Maf factor necessary for controlling 
gonad morphogenesis in Drosophila’. TJ is the only Drosophila ortho- 
logue of the transcriptional factors c-Maf and MafB/Kleisler in verte- 
brates’. In tj mutant gonads, somatic cells fail to intermingle and 
properly develop germ cells. This eventually causes an early blockage 
in germ-cell differentiation and no follicle cells are detected in adult 
ovaries of tj mutants’. All of the OSC piRNAs derived from tj were 
sense-oriented (Fig. 2d), indicating that the tj transcript may serve 
not only as the template for TJ synthesis but also as the precursor of 
the piRNAs. The fj transcript in OSCs was not dedicated to piRNA 
production, because the TJ protein was strongly expressed in OSCs 
(Supplementary Fig. 5a). The existence of piRNAs derived from the 
3’ UTR of tj was further confirmed by northern blot analysis (Sup- 
plementary Fig. 5b). tj-derived piRNAs also appeared in ovarian total 
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Figure 2 | Piwi-associated piRNAs in OSCs. a, The contents of Piwi- 
associated small RNAs in OSCs perfectly match the Drosophila genome 
sequence. b, Examination of nucleotide bias indicates that Piwi piRNAs have 
mostly uracil at the 5’ ends (1st-U). ¢, Mapping data on chromosome 2L. 
Unique mapping data (bottom; shown as ‘unique’) indicate that fj is the 
source of OSC piRNAs. y axis shows the read number. d, The locus 
containing tj (shown in ¢c) is enlarged. Green bars (bottom) indicate piRNAs 
corresponding to tj. Two thick black lines show probes (ORF and 3’ UTR) in 
e. e, Northern blot analysis. Both ORF and 3’ UTR probes visualized a single 
band of the same length corresponding to the ¢j transcript in OSCs. 


small RNA libraries produced by ref. 20 (Supplementary Fig. 6) and 
ref. 6 (see later). 

We assessed whether the transcriptional unit of tj is first divided 
into two parts, each with an individual function—one for TJ syn- 
thesis and the other for piRNA production—or if one full-length tj 
transcriptional unit contains both functions. Northern blot analysis, 
using two probes corresponding to either the open reading flame 
(ORF) or the 3’ UTR of tj, visualized a single discrete band of the 
same length (Fig. 2e), indicating that the latter scenario might be the 
case. 

piRNAs derived from 3’ UTRs (and from slightly extended 
regions) of protein-coding genes other than tj were also found 
(Supplementary Table 1). For example, the 3’ UTRs of brat’! and 
Klp10A” also produce piRNAs (Supplementary Fig. 7). Interestingly, 
they are all derived from the sense strand; thus the parental genes are 
apparently not the targets for gene silencing by Piwi. The parental 
genes are also not repetitive. By contrast, piRNAs derived from retro- 
transposons or flam in OSCs are mainly antisense-oriented and are 
thought to arise from much longer, repetitive precursors’. 

Mutations in zucchini (zuc), a gene encoding a putative nuclease, 
cause female sterility’? and a reduction of roo- and flam-derived 
piRNAs in ovaries*'®. However, whether or not zuc is expressed in 
gonadal somatic cells and, if it is expressed, where Zuc accumulates 
was unknown. We observed that OSCs express zuc (data not shown) 
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Figure 3 | Involvement of zuc in the tj/-piRNA production pathway. a, Myc- 
tagged Zuc protein was overexpressed in OSCs and its subcellular 
localization was determined using an anti-Myc antibody (red). Zuc 
accumulates in the perinuclear region in the cytoplasm of OSCs. DIC, 
differential interference contrast. b, Accumulation of tj-derived piRNAs, but 
not of miRNAs, requires zuc in OSCs. ¢, The read numbers of tj-derived 
piRNAs in various piRNA mutants. These data, obtained from the data set of 
ref. 6, strongly support the idea that zuc is required for tj-piRNA production. 


a__piwi2/CyO piwi?/piwi? b 


Figure 4 | Phenotypes of tj and piwi mutant ovaries and testes. a, In 
control (piwi*/CyO) ovaries, TJ-positive cells and primordial germ cells 
(PGCs) form a mixed cell population. In piwi?/piwi’ larval ovaries, TJ- 
positive cells do not intermingle with PGCs, but instead form coherent 
clusters. TJ and Vasa are shown in green and magenta, respectively. Scale 
bars, 20 tum. b, In ¢j mutant (#j°°?/Df(2L)E55) larval ovaries, Piwi (magenta) 
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and that Zuc is predominantly localized in the cytoplasm of OSCs, 
particularly in the perinuclear region (Fig. 3a). We then sought to 
determine whether zuc is necessary for tj piRNA production in OSCs. 
Depletion of zuc by RNA interference (RNAi) significantly reduced 
the expression level of tj piRNAs, but not of microRNAs (Fig. 3b), 
suggesting that zuc is involved in the tj piRNA production pathway. 
This was further supported by analysis of the data set of ref. 6, which 
indicated that the read number of tj piRNAs in zuc mutants was much 
lower compared with those in other mutants (Fig. 3c). 

It was previously reported that tj mutant somatic cells show a 
failure to intermingle with germ cells in third instar larval ovaries’. 
Notably, we noticed that the piwi mutants” piwi? and piwi’ showed a 
similar phenotype: somatic cells in the ovaries of third instar larvae of 
piwi mutants adhere to each other and exclude Vasa-positive germ 
cells (Fig. 4a and Supplementary Fig. 8a). aub mutants did not phe- 
nocopy the piwi mutants (Supplementary Fig. 8b). piwi mutants 
express the fj transcript (data not shown) and the TJ protein 
(Fig. 4a and Supplementary Fig. 8a) at approximately wild-type 
levels. Thus it is unlikely that tj piRNAs target the parental tj gene. 

We next examined the expression of Piwi in larval ovaries of tj 
mutants. Vasa and TJ are known to be expressed in germline stem cells 
(and in their developing cells) and somatic cells, respectively, whereas 
Piwi is known to be expressed in both cell types*''"*. In wild-type larval 
ovaries, Vasa- and TJ-positive cells are mutually exclusive, but Piwi is 
expressed in both cell populations (Fig. 4b and Supplementary Fig. 9a). 
By contrast, in tj mutants, Piwi expression was restricted to Vasa- 
positive cells (Fig. 4b and Supplementary Fig. 9a). These results indi- 
cate that TJ is the activator of piwi expression in gonadal somatic cells. 

Expression of Piwi in tj mutant testes was also examined. As in 
ovaries, tj mutations in testes caused a lack of Piwi expression in 
gonadal somatic cells, including hub, cyst progenitor cells and early 
cyst cells. In contrast, Piwi signals were clearly seen in these cell types 
of wild-type testes (Supplementary Fig. 9b). Interestingly, without 
functional TJ, germline stem cells and their developing cells in testes, 
which normally show a faint signal for Piwi, highly express Piwi 
(Supplementary Fig. 9b). It seems that TJ in testes negatively controls 
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is expressed only in Vasa-positive (blue) cells. TJ is shown in green. Scale 
bars, 20 jim. c, Quantitative RT-PCR shows that the expression level of 
FasII/ is upregulated by loss of piwi or tj expression. d, Two functions of tj in 
the regulation of the function of piwi in gonadal somatic cells. The first 
function of tj is to activate piwi expression. The second is to supply piRNAs 
for Piwi. Genes targeted by the complex might include FasIII. 
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piwi expression and indirectly controls expression in germline stem 
cells and their developing cells. 

This study has uncovered two functions of tj in the regulation of 
piw?s functions. In gonadal somatic cells, TJ supposedly controls 
transcription of various genes. Our study indicates that piwi is highly 
likely to be one of the genes under strong TJ control because loss of tj 
in gonadal somatic cells abolished Piwi expression. Further support 
for the hypothesis that TJ controls piwi expression was provided by 
DNA sequences near the putative transcriptional start site of the piwi 
gene, which show a weak but significant similarity to the Maf binding 
consensus sequence™, and which were bound with TJ in OSCs 
(Supplementary Fig. 10a, b). Thus, the first function of tjis to activate 
the expression of Piwi in gonadal somatic cells. The second function 
is to supply piRNAs for Piwi. Without the supplement of tj piRNAs, 
Piwi would lose the activity to target genes that should be silenced by 
Piwi and the t/-piRNA complex. A likely target of such silencing is 
FaslIII because FaslIII, a cell adhesion molecule concentrated at the 
hub cell junction, is ectopically overexpressed in other somatic cells 
in f larval testes’. Indeed, the FasIII expression level was higher in 
piwi mutant testes than in control testes (Fig. 4c). Some of the 
piRNAs identified in this study showed strong complementarity to 
the FasIII primary transcript (Supplementary Fig. 10c). Although all 
the tj-derived piRNAs are sense-oriented and thus unlikely to target 
the tj mRNA, given the nuclear localization of Piwi, it is conceivable 
that the Piwi-piRNA complex could associate with the tj gene. 

These findings suggest a novel regulatory circuit where tj mRNA 
simultaneously produces two types of functional molecules (Fig. 4d): 
TJ protein, which activates expression of Piwi, and piRNAs, which 
are loaded on to Piwi to silence specific target genes, such as FasIII 
and other, as yet undiscovered, genes. 


METHODS SUMMARY 


The OSC line was developed from fGS/OSS"’. Piwi was immunopurified from 
OSCs using an anti-Piwi antibody". Cloning of small RNAs associated with Piwi 
in OSCs was carried out as described”. Genome mapping and annotation was 
performed as described in the Methods. Western blotting’, RT-PCR, perioxida- 
tion/B-elimination*”’, northern blotting'* and immunostaining” were performed 
as described. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

OSC culture. Culture medium was prepared essentially as previously 
described’*. Cross and Sang’s M3 (BF) medium was prepared from Shields 
and Sang M3 Insect Medium supplemented with 0.6mgml~' glutathione, 
10% FBS, 10mU ml ' insulin and 10% fly extract. We collected adult Oregon 
R flies and measured their total weight. Flies were treated with 70% ethanol for 
5 min, and then washed several times with PBS. Flies were transferred to a mortar 
and 10 ml of M3 medium supplemented with 10% FBS (M3/FBS) was added. 
Flies were homogenized using a pestle on ice, after which M3/FBS was added to 
the homogenate, giving a final concentration of 0.2 g (fly weight) per ml. This 
homogenate was centrifuged at 6,000g for 10 min. The supernatant was divided 
into 500 pil aliquots and then heat treated at 60 °C for 5 min. This preparation was 
spun at 17,000g for 10 min. The supernatant was centrifuged at least twice more 
and the clear supernatant then collected. The supernatant (fly extract) was stored 
at —30°C before use. Fly extract was sterilized by filtration and added to the 
culture medium to a final concentration of 10%. For passaging, OSCs were 
washed in PBS and then exposed to a solution of trypsin-EDTA for 1 min at 
37°C. Culture medium was then added to neutralize the trypsin. The cells were 
washed by aspiration in the medium and aliquots were pipetted into a 6-well 
plate (Falcon) or a 90-mm cell culturing dish (FP). These were then incubated at 
26 °C (the detailed protocol is available on request). 

Drosophila strains. yellow white (y w) and Oregon R were used as wild-type 
strains. The piwi alleles used were piwi’ and piwi’ (Bloomington stock number 
12225) (gift from H. Lin). Df(2L)BSC145 (Bloomington stock number 9505) is a 
deficiency line that covers the piwi locus. The fj alleles used were tj’ and tj” 
(gift from D. Gobt, S. Kobayashi and H. Sano). Df(2L) E55 (DGRC stock number 
106822) is a deficiency line that covers the tj locus. The aub alleles used were 
aub™®? cn bw/CyO and aub2°” cn bw/CyO. aub™®? cn bw/CyO and aub2cn 
bw/CyO were crossed to yield aub heterozygous flies, aub°/aub"™”. To recog- 
nize piwi', gl and aube/aub™? larvae, w*; L? Pin'/CyO, P{GAL4- 
Kr.C}DC3 P{UAS-GFP.S65T}DC7 (Bloomington stock number 5194) were used 
as balancer chromosomes. All stocks were maintained at 25 °C. 
Immunoprecipitation and cloning of small RNAs associated with Piwi in 
OSCs. Piwi was immunopurified from OSCs using a specific antibody". 
1X 108 OSCs were homogenized in a hypotonic buffer (30mM HEPES, 
pH 7.3, 2 mM magnesium acetate, 5mM dithiothreitol (DTT) and 1 mg ml! 
Pefablock SC) to prepare OSC lysate. Piwi and AGO1 were immunopurified 
using anti-AGO1 (ref. 27) and anti-Piwi (P3G11) antibodies’ immobilized on 
GammaBind beads (GE Healthcare). Just before immunoprecipitation, 
potassium acetate and NP-40 were added to the lysates to 150 mM and 0.1%, 
respectively. Reaction mixtures were rocked at 4 °C for 2h and beads were 
washed five times with washing buffer (30mM HEPES, pH 7.3, 150mM 
potassium acetate, 2mM magnesium acetate, 5mM DTT, 0.1% NP-40 and 
1 mg ml! Pefablock SC). After immunoprecipitation, total RNAs were isolated 
from the immunoprecipitates with phenol-chloroform and precipitated with 
ethanol. RNAs were dephosphorylated with CIP (NEB) and labelled for visu- 
alization with **P-y-ATP using T4 polynucleotide kinase (Takara). Cloning of 
small RNAs associated with Piwi in OSCs was carried out as previously 
described’®. Deep-sequencing was performed on a GS FLX system (Roche). 
Processing sequence tags. The adaptor sequences attached to the 5’ and 3’ ends 
of every sequence produced from the Roche/454 FLX system were removed. 
Ideally, the raw sequence belonged to one of the following canonical patterns: 
5'-ATCGTCTCGGGATGAAA\(N...) TTTCATCCCGAGACGAT-3’ and 3’-ATT 
GACCCGAGTTACAG(N...) TTTCATCCCGAGACGAT-5’, where (N...) repre- 
sents an arbitrary nucleotide sequence for subsequent analyses. The adapters can 
have deletions at their 5'/3’ ends. We used the NCBI bl2seq program (-p 
BLASTN -g F -W 10 -e 10 -D 1) to detect fragments of the adaptor sequences 
in the raw sequence and extracted 21,211 sequences belonging to the canonical/ 
near-canonical patterns. Sequences containing ambiguous Ns were discarded. 
Genome mapping and annotation. We followed a previously described proced- 
ure to map and annotate sequences”. 21,089 sequences were mapped to the 
Drosophila melanogaster (dm3) genome. The annotation results are shown as a 
pie chart in Fig. 2a. Supplementary Fig. 2a compares the characteristics of small 
RNAs associated with ovarian Piwi, Aub, AGO3* and OSC Piwi in terms of 
strand bias (left) and cloning frequency (right). The strand bias is computed 
by comparing the mapping strand of a piRNA with the strand of a coinciding 
transposon (112 popular classes are used) of the natural transposable element 
(FlyBase**) track of the UCSC GenomeBrowser for Functional RNA”. To dis- 
tinguish a piRNA solely mapped on to a single genomic locus from piRNAs 
mapped on to multiple genomic loci, we use a term ‘unique mapping’. 
Sequence logo. We used the Web Logo program” to produce Fig. 2b and 
Supplementary Fig. 2c. The sequences, ranging from 25 to 27 bases, are aligned 
to the 5’ end with appended ‘-’ to fill the gaps for sequences shorter than 27 bases. 
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Frequency map. Frequency maps for OSC piRNAs were generated (Fig. 2c and 
Supplementary Fig. 3) representing the number of piRNAs that are 100% 
identical to the genomic sequence in a 5-kb sliding window. piRNAs in the 
forward strand are rendered in green whereas piRNAs in the reverse strand are 
rendered in red. 

Surveying ten-base binding partners. We performed a sequence similarity 
search using the NCBI BLASTN program among small RNAs associated with 
ovarian Piwi, Aub, AGO3’ and OSC Piwi to detect potential binding partners 
that bind each other with 10-base full complementarity. We used the sequences 
that were 100% identical to the genomic sequence. 

Northern blot, western blot and B-elimination analyses. Northern blot analysis 
was carried out essentially as previously described". Total RNAs of OSCs and $2 
cells were isolated using ISOGEN (Wako). Total RNAs from the immunopreci- 
pitates were isolated with phenol-chloroform and precipitated with ethanol. 
DNA fragments for detecting tj transcripts (accession number AY325814) were 
cloned into pBS SK+. The ORF probe corresponded to nucleotides 801-1300 
and the 3’ UTR probe corresponded to nucleotides 2143-2402. Sequences of the 
primers were as follows: tj-ORF-forward, 5’-CATGACATGATGTGGCTGAC-3’; 
tj-ORF-reverse, 5’-GATCTGGTCGAGCTGGCG-3’; fj-utr-forward, 5’-TTTTCA 
AGAGAAGTGCATTCCC-3’; #j-utr-reverse, 5’-TATCTCATCTATCTCAATCT 
CTGTC-3’. PCR products were used as templates. DNA probes were synthesized 
using a random primed labelling kit (Takara) in the presence of **P-dCTP. For 
small RNA northern blot analyses, probes used for miR-310, tj-piR-1, tj-piR-2, 
klp10a-piR-1 and brat-piR-1 were as follows: miR-310, 5'-AAAGGCCGGGA 
AGTGTGCAATA-3’; tj-piR-1, 5'-GGTAATGGGAATGCACTTCTCTTGAA-3’; 
tj-piR-2, 5'-TCTCATCTATCTCAATCTCTGTCGA-3’; klp10a-piR-1, 5'-GAT 
TGCAGTTCGGTTTGCGTGTGA-3’; brat-piR-1, 5'-TTGTGTCGCGGTTCGG 
TTTTGGGT-3’. The DNA oligonucleotides were labelled with T4 polynucleotide 
kinase in the presence of **P-y-ATP. Western blot analysis was performed as 
described previously*. Ten micrograms of protein from each sample was loaded 
on gels. Culture supernatants of anti-Piwi hybridoma cells (P3G11)'*, a mouse 
monoclonal antibody for Aub’®, culture supernatants of anti-AGO3 hybridoma 
cells‘, a rabbit polyclonal antibody to Vasa (1:1,000 dilution), a guinea-pig 
antibody to TJ (1:1,000 dilution)’ and anti-tubulin (from DSHB) (1:5,000 dilu- 
tion) were used. Periodate oxidation/B-elimination treatment was performed as 
previously described”. 

RT-PCR analyses. Total RNAs of ovaries, $2 cells and OSCs were isolated using 
ISOGEN according to the manufacturer’s instructions. Total RNAs were treated 
with DNase to eliminate DNA contamination. Five-hundred nanograms of total 
RNA was annealed with an oligo-dT primer. Reverse transcription was performed 
using Accuscript High Fidelity Reverse Transcriptase (Stratagene) according to 
the manufacturer’s instructions. The resultant cDNAs were amplified using 
KOD-plus DNA polymerase and primers for each gene. Sequences of the oligonu- 
cleotide primers used are: Dmhen1/Pimet, 5’-ATGTTTTCGCACAAGTTTATTTG 
CGGG (forward), 5’'-GCCCAAACACCAGTTCTTGAAC (reverse); AGO3, 
5'-GCGAGACGAAGTACGGTCAGATAAC (forward), 5'-CAATCAAATAAG 
CCAATTTGTGAGCG (reverse). For quantitative RT-PCR, total RNA (0.1 Lg) 
was used to reverse transcribe target sequences using a Transcriptor First strand 
cDNA Synthesis Kit (Roche) according to the manufacturer’s instructions. The 
resulting cDNAs were amplified with a LightCycler 480 Real-Time PCR 
Instrument II (Roche) using the LightCycler 480 SYBR Green I Master (Roche). 
The primers used are shown in Supplementary Table 2. 

Myc-tagged protein expression and RNAi in OSCs. A short DNA fragment 
encoding a Myc-tag was inserted into the KpnI site of pAc5.1/V5-HisA 
(Invitrogen) to produce pAcM. Full-length piwi and zuc cDNA were inserted 
into pAcM to yield Myc—Piwi wild type and Myc—Zuc, respectively. To produce 
Myc-—Piwi-AN, a partial fragment of the piwi gene (amino acids 73-843) was 
amplified and inserted into pAcM. Mutagenesis to yield Myc—Piwi-DDAA and 
Myc—Piwi-AN-DDAA was performed with a QuikChange PCR kit using Pfu 
Turbo (Stratagene). Primers used are shown in Supplementary Table 2. 
Trypsinized OSCs (3 X 10° cells) were suspended in 100 ul of Solution V of 
the Cell Line Nucleofector Kit V (Amaxa Biosystems) and mixed with 5 1g of 
expression plasmid or 200 pmol of siRNA duplex. Transfection was conducted in 
an electroporation cuvette using the Nucleofector instrument (Amaxa 
Biosystems). The transfected cells were transferred to fresh OSC medium and 
incubated at 26 °C. In the case of RNAi, cells were transfected again after 48 h (the 
detailed protocol is available on request). The siRNA duplexes used are shown in 
Supplementary Table 2. 

Immunostaining. OSCs were fixed with 4% formaldehyde in PBS for 15 min and 
treated with 0.1% Triton X-100 in PBS for 15 min. After blocking with 2% BSA in 
PBS, they were incubated with the primary antibody for 1h. Immunostaining of 
larval ovaries was performed essentially as previously described”*. Briefly, third 
larval or adult ovaries were dissected manually in PBS. The samples were fixed in 
4% paraformaldehyde in PBS for 20 min and treated with methanol and detergent. 
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The samples were washed three times (20 min each) in PBTx (PBS containing 0.1% 
Triton X-100) and blocked in PBT xb (PBTx containing 5% bovine serum (Sigma) ) 
for 1.5 h. Subsequently, samples were incubated with primary antibodies in PBT xb 
for 16h at 4°C. After washing three times (20 min each) in PBTx, samples were 
incubated with secondary antibodies in PBTxb for 16h at 4 °C and rinsed three 
times (20 min each) in PBTx. Culture supernatants of anti-Piwi hybridoma cells 
(P4D2, 1:1 dilution)'*, a mouse monoclonal antibody to Myc-tag (1:1,000 dilution, 
Sigma), a guinea-pig antibody to TJ (1:2,000 dilution)’, a rabbit polyclonal 
antibody to Vasa (1:1,000 dilution) and a mouse antibody to FasIII (1:50 dilution) 
were used for primary antibodies. Alexa-488-conjugated anti-mouse IgG (Mole- 
cular Probes), Cy3-conjugated anti-mouse IgG (Sigma), Alexa-546-conjugated 
anti-rabbit IgG (Molecular Probes), Alexa-633-conjugated anti-rabbit IgG 
(Molecular Probes), Alexa-546-conjugated anti-guinea-pig IgG (Molecular 
Probes) and Alexa-488-conjugated anti-guinea-pig IgG (Molecular Probes) were 
used as the secondary antibodies. DNA was stained with DAPI. All images were 
collected using a confocal microscope (Zeiss L5M5 EXCITER). 

ChIP assay. ChIP assays were performed using the EZ ChIP Chromatin 
Immunoprecipitation Kit (Upstate). Briefly, 2 x 10’ cells were crosslinked with 
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1% formaldehyde for 10min at room temperature and sheared to produce 
crosslinked DNA (~200-1,000 bp in length). The DNA-protein complexes were 
immunoprecipitated overnight with either non-immune IgG antibody or anti-TJ 
antibody. After reversing crosslinking at 65 °C for 12-16h, DNA was recovered. 
Short DNA fragments corresponding to piwi gene were amplified by a LightCycler 
480 Real-Time PCR Instrument II using SYBR Premix Ex Taq (Takara). The 
primers used are listed in Supplementary Table 2. 
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Exploitation of binding energy for catalysis and design 


Summer B. Thyme’”, Jordan Jarjour””, Ryo Takeuchi®, James J. Havranek’, Justin Ashworth’”, 
Andrew M. Scharenberg~”, Barry L. Stoddard®® & David Baker’** 


Enzymes use substrate-binding energy both to promote ground- 
state association and to stabilize the reaction transition state selec- 
tively’. The monomeric homing endonuclease I-Anil cleaves with 
high sequence specificity in the centre of a 20-base-pair (bp) DNA 
target site, with the amino (N)-terminal domain of the enzyme 
making extensive binding interactions with the left (—) side of 
the target site and the similarly structured carboxy (C)-terminal 
domain interacting with the right (+) side’. Here we show that, 
despite the approximate twofold symmetry of the enzyme-DNA 
complex, there is almost complete segregation of interactions 
responsible for substrate binding to the (—) side of the interface 
and interactions responsible for transition-state stabilization to 
the (+) side. Although single base-pair substitutions throughout 
the entire DNA target site reduce catalytic efficiency, mutations in 
the (—) DNA half-site almost exclusively increase the dissociation 
constant (Kp) and the Michaelis constant under single-turnover 
conditions (K},*), and those in the (+) half-site primarily decrease 
the turnover number (k,,,*). The reduction of activity produced by 
mutations on the (—) side, but not mutations on the (+) side, can 
be suppressed by tethering the substrate to the endonuclease 
displayed on the surface of yeast. This dramatic asymmetry in 
the use of enzyme-substrate binding energy for catalysis has direct 
relevance to the redesign of endonucleases to cleave genomic 
target sites for gene therapy and other applications. Compu- 
tationally redesigned enzymes that achieve new specificities on 
the (—) side do so by modulating K\*, whereas redesigns with 
altered specificities on the (+) side modulate k,,,*. Our results 
illustrate how classical enzymology and modern protein design 
can each inform the other. 

Enzymes use interactions with the substrate to promote catalysis 
both by bringing the substrate into close proximity and proper align- 
ment with catalytic groups on the enzyme and by selectively stabilizing 
the transition state for the chemical reaction*>. Dissection of the 
contributions to enzyme catalysis has taken on renewed importance 
with the advent of computational and directed evolution approaches 
for engineering novel enzymatic activities for applications ranging 
from synthetic chemistry to therapeutics®*’. Reprogramming the 
specificity of the LAGLIDADG family of homing endonucleases 
for genome engineering and biotechnology purposes is one such 
application®”. 

Control experiments probing the binding specificity of the I-Anil 
homing endonuclease, in preparation for computational redesign of 
specificity, revealed a striking asymmetry in the effect of base substitu- 
tions on binding affinity (Fig. la). DNA cleavage and DNA binding by 
Y2 I-Anil endonuclease’’ were assayed for 60 different target sites, 
each containing a single base-pair substitution from the wild-type 
recognition sequence. Consistent with previous observations’, 
enzyme activity assays showed that many nucleotide substitutions 


throughout the extended 20-bp recognition site abrogated or reduced 
cleavage, reflecting the high sequence specificity of the endonuclease 
(Fig. 1b). Fluorescence-binding experiments showed that for muta- 
tions between — 10 and —3 on the (—) side of the interface, this loss of 
cleavage activity is associated with a loss of binding affinity. In sharp 
contrast, mutations in the —2 to +10 region of the recognition site, 
which also eliminated or reduced cleavage, had a minimal affect on 
substrate binding (Fig. Ic). 

To determine whether the differences between the (—) and (+) 
side substitutions reflected differential contributions to ground-state 
association versus transition-state stabilization, the extent of cleavage 
of a linear double-stranded template as a function of time was deter- 
mined for all 60 singly substituted sites under single-turnover con- 
ditions, and pseudo-Michaelis—Menten parameters’? Ky* and kea* 
were obtained from these data (Supplementary Figs 1-3). Com- 
parison of k.a*/Ky* for related substrates highlights the high 
sequence specificity of the enzyme: for example, at position —4 
keat*/Ku* for the wild-type G:C base pair is more than 2,000-fold 
greater than for A:T and more than 400-fold greater than for C:G 
(Fig. 1b and Supplementary Table 1; because specificity is determined 
by the differences in k.at*/Ky* for different substrates’’, these results 
provide perhaps the most rigorous quantification of homing endo- 
nuclease specificity so far). The contribution of target-site interactions 
to ground-state stabilization (K,*, Fig. 1d) versus transition-state 
stabilization (k.:*, Fig. le) was found to be skewed: substitutions 
on the (—) side increased K\,* significantly without reducing k.a:*, 
whereas substitutions on the (+) side decreased k.,,* with little effect 
on Ky*. The overall segregation of the kinetic contributions to spe- 
cificity is shown graphically in Fig. 1fand in the structural schematic in 
Fig. 1a: most single-base substitutions in the target affect k..,* (blue, 
(+) side) or Kyy* (red, (—) side) but not both. The striking feature of 
our results is that the apparent symmetry of the binding interface is 
completely broken during catalysis: chemically, very similar protein— 
DNA contacts are used for substrate association on the left side and 
selective transition-state stabilization on the right side. 

Surface display methods are widely used to engineer proteins with 
new binding specificities'*. The sequence specificity profile obtained 
for singly substituted target sites binding to I-Anil displayed on the 
surface of yeast closely parallels the profile observed in the solution 
fluorescence experiments (Fig. 1c) (J.J., B.L.S. and A.M.S., unpublished 
observations). We reasoned that cleavage of mutated target sites with 
increased K,,;* should be suppressible by tethering the DNA duplex 
containing the target site adjacent to the displayed enzyme on the yeast 
surface; the increase in local substrate concentration should compen- 
sate for the decrease in ground-state binding affinity (Fig. 2b). Indeed, 
mutations between positions —10 and —3 (red) that greatly reduced 
binding in solution do not slow tethered cleavage on the yeast cell 
surface (Fig 2c). In contrast, substitutions on the right side of the target 
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Figure 1| Segregation of contributions to binding and catalysis. a, Ribbon 
diagram of the I-Anil enzyme in complex with the wild-type target site 
(2QOJ"’). Target site and positions of DNA cleavage are shown below: (—) 
side cleavage site is cut before (+) side site’®. b-e, Colour scheme: A, green; 
C, blue; G, yellow; T, red; error bars, s.e.m. b, k.a¢*/Ky,* values for the wild- 
type target site (red star) and each of the 60 singly substituted target sites 
(vertical bars). Substitutions throughout the length of the target site 
abrogate enzyme activity, demonstrating the high sequence specificity of the 
enzyme. ¢, Relative binding affinities determined for each singly substituted 
target site using fluorescence competition assays. Substitutions on the left 
side, but not the right side, significantly reduce binding affinity. d, Kyy* values 
for each singly substituted target site relative to the wild type. As in 

c, substitutions on the left but not the right display significantly different 
values from wild type. e, k.a:* values for each singly substituted target site 
relative to the wild-type site. In contrast to ¢ and d, substitutions between 


site (blue) that reduced cleavage in solution also reduced enzyme 
activity in the tethered cleavage assay, consistent with their reduction 
of k.a:*. Substitutions that disrupt interactions involved in selective 
transition-state stabilization cannot be overcome by increasing the 
local concentration of substrate. 

Assuming the simple free-energy diagram in Fig. 2a, we can make 
inferences from the kinetic data in solution and on the yeast surface 
about the structures of the Michaelis and transition-state complexes. 
Side-chain—base-pair interactions from positions — 10 to —5 are pre- 
sent in both the Michaelis complex and the transition state (base 
substitutions increase K,\,* and Kp in solution and do not affect k..* 
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positions —4 and +9 have significant effects. Substitutions for which 

Ky* was too high (>750 nM) to allow separate determination of k..,* and 
Ky* are indicated by bars with dashed lines in d, and are left blank in 

e. f, Asymmetry of the contributions to k,a* and K\y*. Positions shown in 
red are on the left (—) side of the target site from —10 to —5 and almost 
exclusively contribute to K\y*. Positions shown in blue are on the right (+) 
side of the target site from positions +3 to +7. The boundary positions, —4, 
—3 and +6, contribute to both k,.,* and K\* and are shown in yellow. To 
portray the structural context of these positions, the target site in a is 
coloured based on the effect of the mutation on k,,,*, normalized by the sum 
of the effects on keat* and Kyy* ([|Aln(keat*) |/ 

|(Aln(Kyy*)| + |Aln(k.at*) |)]: close to 1.0, blue; close to 0.0, red; 
intermediate, yellow; position where Ky* and k,,:* could not be separately 
determined; grey). 


or the rate when tethered). Sequence-specific base-pair interactions 
from +3 to +8 are formed only in the transition state (substitutions 
have no effect on K\,* or Kp, reduce k...*, and slow the rate when 
tethered). A third class of interactions (at —5 and +7 for example) 
appear to be formed in the Michaelis complex but not the transition 
state (substitutions increase or decrease both k..4* and Kyy*/Kp). 
Importantly for the design calculations described in the next sec- 
tion, three observations suggest that the crystal structure of the com- 
plex likely resembles the transition state more than the Michaelis 
complex: (1) specific interactions on the (+) side of the DNA target 
present in the crystal structure appear to be formed in the transition 
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Figure 2 | Contributions to catalysis. a, Free-energy diagram showing the 
effect of target-site substitutions on the free energies of substrate binding and 
transition-state stabilization. ETS, enzyme-transition-state complex; ES, 
enzyme-substrate complex; E+S, free enzyme and substrate; E+P, free 
enzyme and product; DG, free energy difference. Most substitutions on the 
left side increase Ky,*, and hence the free energy difference between ES and 
E+S, but leave K.a:*, and hence the free energy difference between ETS and 
ES, unchanged. These substitutions thus seem to disrupt interactions made in 
both states. Most substitutions on the right side instead leave Ky* unchanged 
but decrease K,.:*, and hence raise only ETS, suggesting they remove 
interactions present in ETS but not ES. A small subset of positions (labelled 
boundary) appears to stabilize or destabilize ES selectively while not affecting 
ETS; these substitutions may disrupt interactions present in ES but not in 
ETS. b, Free-energy profiles for a free (left) and tethered (right) system. Red 
profile: substitutions that remove interactions present in both ES and ETS; 


state but not the Michaelis complex; (2) the third class of substitu- 
tions mentioned above that appear to stabilize only the Michaelis 
complex make few interactions in the crystal structure (Sup- 
plementary Fig. 4); and (3) calculations of Rosetta specificity based 
on the crystal structure correlate better with catalytic efficiency than 
with binding affinity (Supplementary Fig. 5). 

Monomeric LAGLIDADG homing endonucleases, which recog- 
nize non-palindromic targets, are attractive scaffolds for genome 
engineering applications’*. An important challenge is to reprogram 
the substrate specificity of these enzymes towards desired target 
sequences’. To redesign I-Anil specificity using Rosetta’®, the target 
site in the crystal structure of the I-Anil protein-DNA complex is 
mutated in silico and the program searches for combinations of 
amino-acid substitutions that allow the formation of energetically 
favourable interactions with the new base pairs, but not with the 
wild-type base pairs'®. Design calculations were performed for six 
target-site variants bearing single base-pair substitutions, genes 
encoding the amino-acid sequences of eight redesigned enzymes 
were constructed and the enzymes purified. DNA cleavage assays 
revealed that the designed specificity changes were for the most part 
achieved (Supplementary Table 2). These results demonstrate that 
I-Anil cleavage specificity can be reprogrammed by computational 
protein design, thereby providing starting points for the larger-scale 
specificity changes required to cleave physiological target sites. 

An enzyme redesigned for a new target site could achieve altered 
specificity either by changing k.a.*, changing K\y* or changing both. 
To determine whether the designed changes in specificity were a 
result of changes in Ky* or k..:*, for each of eight designed endonu- 
cleases we measured the single-turnover cleavage kinetics for target 
substrates containing each of the four possible base pairs at the 
redesign position (Supplementary Table 2). A design aimed at spe- 
cific recognition of a DNA target site containing base pair —8G:C 
(Fig. 3a) achieved specificity exclusively by modulating K\y*: Ky* 
decreased for G:C, and increased for A:T, T:A and C:G. In contrast, a 
design aimed at specific recognition of +8C:G (Fig. 3b) achieved 
specificity entirely through k..,*: koai* decreased for A:T, G:C and 
T:A, but was unchanged for +8C:G. Both of these designed enzymes 
have high specificity at neighbouring base pairs, and overall specifi- 
cities that are higher than the wild-type enzyme in the targeted 
regions (Supplementary Fig. 6). A design aimed at specific recog- 
nition of the —3C:G substitution (Fig. 3c), at the boundary between 
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blue line, substitutions that remove interactions present in ETS but not ES. 
Tethering increases the free energy of free E + S to the point that the rate 
depends only on the free-energy difference between ES and ETS. Because this 
free-energy difference is unchanged by substitutions that remove interactions 
made in both ES and ETS (red profile), they do not affect the rate in the 
tethered case. c, Cleavage kinetics of endonuclease tethered to yeast cells. 
Surface displayed enzyme cleaves a tethered fluorescently labelled 
oligonucleotide, which then diffuses away from the yeast surface resulting in 
loss of fluorescence. Black, wild-type target site; random DNA, grey; shades of 
red, left-side target-site substitutions; shades of blue, right-side target-site 
substitutions. Tethering suppresses decreases in cleavage rate produced by 
(—) side but not (+) side mutations. The rate increases observed for some (— ) 
side variants such as —5c were consistent across multiple independent 
experiments and with the in vitro results in Fig. le. 


Ku*- and k,,,*-influencing positions (Fig. 1), displayed changes in 
both k.a* and Ky,*, consistent with the results with the wild-type 
enzyme at this position. These trends hold for the remaining designs 
as well (Supplementary Table 2 and Supplementary Fig. 7): we find 
generally that the left-side designs achieve specificity primarily by 
modulating ground-state binding affinity, whereas the right-side 
designs achieve specificity by modulating the stability of the transition 
state. 

Our results suggest that initial binding of I-Anil to its target site 
involves formation of base-specific interactions on the (—) side and 
lower-affinity non-specific interactions on the (+) side to form the 
Michaelis complex (the latter are suggested by yeast display experi- 
ments which show that the enzyme binds less tightly to the (—) half- 
site than to the full site (J.J., B-L.S. and A.M.S., unpublished observa- 
tions)). Catalysis then requires bending of the DNA (note bend in 
Fig. la), which is stabilized at the transition state by newly formed 
specific interactions between the (+) side and the enzyme. Such a 
two-stage mechanism (see Supplementary Information section C) 
may be a general solution to the problem of specific target-site recog- 
nition by enzymes that act on distorted DNA substrates. If the 
enzyme only bound to the distorted site, binding would require 
enzyme to be at the site (which may occur only once in the genome) 
simultaneous with fluctuation of the DNA into the distorted con- 
formation; because both are rare events, the net rate of binding, the 
product of two small numbers, would be very slow. If, instead, the 
enzyme can bind with some sequence specificity to undistorted target 
sites, the probability of being close enough to capture (and perhaps 
promote) fluctuations that distort the DNA will be very much higher. 
In I-Anil the total transition-state binding energy appears to be 
roughly divided between the two steps: the N-terminal domain 
guides the enzyme to potential target sites that match on the (—) 
side, and the C-terminal domain specifically stabilizes the transition 
state if there is also a match on the (+) side. 

There is considerable synergy between classical enzymology and 
modern computational design. Design should be informed by detailed 
analyses of the wild-type enzyme because, depending on the enzyme 
and substrate concentrations in the application the designed enzymes 
are to be used for, it may be necessary to re-engineer Ky, k.a, and/or 
keat/Ku. Conversely, computational design can provide insight 
into the basis for transition-state stabilization. Our kinetic dissec- 
tion of I-Anil cleavage activity also has implications for endonuclease 
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Figure 3 | Computational redesign of specificity. Colour scheme: A, green; 
C, blue; G, yellow; T, red; error bars in right panels, s.e.m. a, Design for 
—8A:T to —8G:C substitution (K24N, T29K). Middle panel: the designed 
residues N24 and K29 make direct hydrogen bonds to —8G and —8C, 
respectively. Left panel: the concentration dependence of the cleavage 
activity for the designed enzyme (solid lines) for different base pairs at the 
—8 position differs considerably from the wild-type enzyme (dashed lines). 
Right panel: the k,,,* values remain approximately the same for both the 
wild-type and designed enzymes against all target sites, but the K\;* values 
are decreased for the target G base pair (arrow 3) and increased significantly 
for the other three substitutions (arrows 1, 2 and 4). b, Design for +8A:T to 
+8C:G substitution (L156Q, I164R, T2048). Middle panel: designed 
residues R164 and Q156 make direct hydrogen bonds to +8G. Designed 


re-engineering using yeast display: selection based on binding may be 
sub-optimal because substrate binding could be optimized at the 
expense of transition-state stabilization, whereas selection for cleavage 
in the tethered substrate system could yield variants with decreased 
solution cleavage due to increased Ky,*. These pitfalls could poten- 
tially be overcome by selecting both for k...* and Ky*, perhaps by 
alternating between the two selection procedures. More generally, the 
union of classical enzymology with modern computational design and 
selection technology, as illustrated here, provides a powerful approach 
to revealing the mechanistic basis for, and subsequently reprogram- 
ming of, sequence-dependent molecular recognition. 


METHODS SUMMARY 

Experimental preparation and kinetic analysis. I-Anil was expressed in 
Escherichia coli BL21(DE3) using a standard auto-induction protocol and puri- 
fied over a His-trap column. Linearized plasmid substrates were prepared for 


LETTERS 


eet ele 

residue $204 holds R164 in position. The kinetic traces (left panel) and bar 
graphs (right panel) show this design achieves altered specificity through 
changing k,,*. The K\y* values remain approximately the same for both the 
wild-type and designed enzymes against all target sites, but the k..:* values 
are significantly decreased for all of the competitor target sites (arrows 1, 2 
and 3). c, Design for —3G:C to —3C:G substitution (Y18W, E35K, R61Q). 
Middle panel: designed residues K35 and Q61 make a direct hydrogen bond 
to —3G and a water-mediated hydrogen bond to —3C, respectively. Q61 and 
K35 also hydrogen bond with each other, and designed residue W18 further 
helps position K35 through packing interactions. The kinetic traces (left 
panel) and bar graphs (right panel) show this design achieves altered 
specificity through changing both k,,* and Ky*. The designed enzyme has 
an increased k,,,* (arrow 2) and decreased Ky* for the —3C (arrow 1). 


each of the 60 singly substituted target sites. Kinetic assays were performed over a 
20-fold range of enzyme concentrations (from 30 to 1,500 nM, depending on the 
substrate) with 5 nM DNA substrate, and analysed by agarose gel electrophoresis 
followed by integration of product and substrate band densities. The velocity 
versus enzyme concentration profiles were determined two to four independent 
times; reported k.a:* and Ky,* values are the average of those determined from 
the independent experiments. Fluorescence competition-binding assays were 
performed as previously described’’. 

Computational design. New target sequences were mapped on to the I-Anil- 
DNA crystal structure (2QOJ'') and the Rosetta computational design meth- 
odology was used to optimize the amino-acid sequence of the protein to max- 
imize affinity for the new site'®. The predicted specificity of the resulting protein 
models for the desired target sequence was computed using Rosetta, and designs 
that were predicted to bind tightly and specifically were subjected to further 
optimization using flexible backbone protein design (Supplementary 
Methods). The tightest binding and most specific designs were again selected, 
and the designed amino-acid substitutions were removed one at a time. If no 
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significant loss was predicted in either specificity or binding energy, the substi- 
tution was removed from the design. The ‘-8G:C_A’ (K24N/T29K) and 
‘-8G:C_B’ (K24N/T29Q) designs were generated instead using a genetic algo- 
rithm to optimize binding affinity and specificity simultaneously (Supplemen- 
tary Methods). Genes encoding the designed proteins were assembled from 
oligonucleotides, and the designed proteins were expressed, purified and assayed 
as described above. 

Tethered cleavage on yeast surface. PCR-generated DNA substrates, labelled with 
biotin and Alexa 647, were tethered by an antibody—streptavidin—phycoerythrin 
bridge to the haemagglutinin epitope of I-Anil expressed on the surface of S. 
cerevisiae in conditions that prohibited catalysis. Samples were then spiked with 
10mM MgCl, and placed in a pre-warmed chamber at 37°C and fluorescence 
measurements were acquired on a flow cytometer. The Alexa 647 signal from a 
phycoerythrin-normalized population of each sample was then plotted against 
time to generate the curves shown. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein expression and purification. Genes encoding Y2 I-Anil'*’* designs were 
assembled from oligonucleotides”, cloned into a variant of the pet15 expression 
vector and sequence-verified plasmids were transformed into BL21 Star 
(Invitrogen). A 1-1 culture ofauto-induction media” was inoculated with several 
colonies, grown at 37°C for about 12h (to approximately saturation) and 
expression at 18 °C continued for about 24 h. Cells were harvested, resuspended 
in Tris 20 mM pH 7.5, 1.0 M NaCl and 30 mM imidazole, and lysed by sonication 
and lysozyme. The soluble fraction was loaded onto a 1 mL HisTrap FF crude 
column (GE Healthcare) and I-Anil variants were purified by imidazole gradient 
elution on an AKTA express (GE Healthcare). The proteins were concentrated 
and the buffer was exchanged to Tris 20 mM pH 7.5, 500 mM NaCland 50% (v/v) 
glycerol for storage. Purity of the proteins was assessed by SDS—polyacrylamide gel 
electrophoresis and the concentration of samples with about >95% purity was 
determined by measuring the absorbance at 280 nm using the calculated extinc- 
tion coefficient*'. The concentration of enzyme in the <95% pure samples was 
determined by generating a standard curve with a pure I-Anil protein, correlating 
protein concentration with band density (calculated with ImageJ (http://rsbweb. 
nih.gov/ij/)) and comparing the band density of the I-Anil protein in impure 
samples run on the same gel as the standard curve. 

Plasmid substrate construction. All single base-pair variants from the wild-type 
target site in pBluescript were individually constructed by site-directed muta- 
genesis as described”. Sequence-verified plasmids were linearized with Scal 
before the kinetic assays to facilitate product identification. 

Endonuclease activity assays. Kinetic assays. Previous work” has confirmed 
that I-Anil, like other LAGLIDADG endonucleases, is a single-turnover enzyme, 
and the conditions for single-turnover kinetics'* were met in all experiments. 
The ionic strength of the enzyme reaction buffer was optimized for enzyme 
activity and stability to a final solution of 170mM KCl, 10mM MgCl, and 
20 mM Tris, pH 9.0. Enzyme was diluted in 1.25 reaction buffer to working 
concentrations, serial twofold dilutions were made, and both substrate plasmid 
and diluted enzyme were incubated separately at 37 °C for 1 min. The appropri- 
ate amount of plasmid (one-fifth of the reaction volume) was added to each 
reaction for a final 1X reaction buffer and final plasmid concentration of about 
5 nM (the lowest concentration still readily visible on agarose gel). The plasmid 
(one-fifth of reaction volume) was added to the enzyme (four-fifths of reaction 
volume) to minimize heat loss during the transfer (which was found to add 
significant noise to the data). Reactions were halted with 200 mM EDTA, 30% 
glycerol and bromophenol blue. DNA fragments were separated on 1.2% agarose 
TBE gels, which were then stained in a standard ethidium bromide solution and 
subsequently destained in water for maximum contrast between DNA and back- 
ground. All data were collected by integrating the density of the substrate 
(2,959 bp) and product bands (1,801 bp and 1,158 bp) using ImageJ (http:// 
rsbweb.nih.gov/ij/). The percentage of product formed is equal to the sum of 
the density of the two product bands divided by the total sum of the densities of 
the three bands. The progress curves fit to single exponentials for all enzyme 
concentrations (Supplementary Fig. 1) and for all target sites except for several 
substitutions in the central four base pairs between the cleavage sites on the two 
DNA strands (Supplementary Fig. 2). 

Assays for specificity positions adjacent to designed nucleotide. Twofold serial 
dilutions of enzyme from 1500 to 11 nM were made in 1.25 reaction buffer and 
the enzyme was reacted with about 5nM substrate (in 1X reaction buffer) for 
half an hour at 37 °C. Reactions were halted and data were analysed as described 
in the ‘Kinetic assays’ section. 

Fluorescence competition binding assay'’. Unlabelled DNA oligonucleotides 
with each of the 60 single base-pair substitutions in the I-Anil target site (wild- 
type I-Anil site, 5’-TGAGGAGGTTTCTCTGTAAG-3’), a negative control 
sequence (5’-CTCTTCTTGCATATATCTCC-3’), an unlabelled wild-type site 
oligonucleotide and a wild-type site oligonucleotide labelled with 5’ Cy3 were 
synthesized with six consecutive ‘A’ flanking on each end (Integrated DNA 
Technology, 100-nmol scale, salt-free). Complementary oligonucleotides were 
ordered for all 63 sites and double-stranded target DNA was prepared by anneal- 
ing equal amounts of complementary strands. 

His-tagged I-Anil was immobilized by incubating 200 1l of 100 nM I-Anil in 
TBS/BSA buffer (50 mM Tris-HCl (pH 7.5), 150 mM NaCl, 0.2% BSA) in wells 
of Nickel-NTA coated HisSorb Plates (Qiagen) for 2h at room temperature. 
Unbound protein was removed and the plates were washed four times with TBS/ 
Tween-20 (50mM Tris-HCl (pH 7.5), 150mM NaCl, 0.05% Tween-20). The 
immobilized I-Anil in the microtitre plate was incubated for about 4 h with both 
100 nM labelled target DNA duplex and 3 11M (30-fold excess) of one unlabelled 
duplex per well in 200 ul of binding buffer (50 mM Tris-HCl (pH 7.5), 150 mM 
NaCl, 0.02 mg ml’ poly(dI-dC), 10 mM CaCl). The plates were washed four 
times with TBS (50 mM Tris-HCl (pH 7.5), 150 mM NaC]), and the fluorescent 
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signal retained in each well was quantified using a SpectraMax M5/M5e 
Microplate Reader (Molecular Devices) (excitation, 510 nm; emission, 565 nm; 
cutoff, 550nm). Additional negative control experiments performed in the 
absence of the enzyme indicated that no significant detectable fluorescent signal 
was retained after the protocol described above was completed. Relative binding 
affinities were calculated using the following equation: relative binding 
affinity = [(F, — F,) X Fl/[(Fa — F,) X FJ, where F,, F, and F, indicate fluor- 
escent intensities obtained from wells in which the immobilized protein was 
incubated with the unlabelled singly substituted target sites, wild-type target site 
and negative control sequence, respectively. 

Cleavage kinetics of endonuclease tethered to yeast cells. Surface display of I-Anil 
on S. cerevisiae was performed using standard methods". For each sample, 5 X 10° 
cells were stained with biotinylated anti-HA11, followed by secondary staining with 
streptavidin-DNA substrate conjugates (1:3 molar ratio) on ice and in the absence 
of divalent cations. DNA substrates were generated by PCR using biotinylated and 
Alexa 647-conjugated primers complementary to the 5’ and 3’ sequences flanking 
the I-Anil target (or indicated target-site variants). PCR products were purified by 
Exol digestion followed by size exclusion chromatography ona G-100 column (GE 
Healthcare) before conjugation with streptavidin—phycoerythrin. Samples were 
then spiked with 10 mM MgCl, and placed in a pre-warmed 37 °C chamber and 
acquired at an approximate event rate of 3000s | for 400s on a BD FACSAria II 
flow cytometer. Processing was performed using FloJo software (Treestar). Live 
cells were gated by forward- and side-scatter properties, and doublets and clumped 
cells were excluded on the basis of forward scatter area versus height linearity. The 
Alexa 647 signal from a phycoerythrin-normalized population of each sample was 
then plotted against time to generate the curves shown. 

Computational design methods. Single-state design (designs —9C, —8G_C, 
—6C, +5C and +8C). The computational design of homing endonuclease— 
DNA specificity was performed using the Rosetta design software in a manner 
specifically designed to predict new protein sequences that would bind with high 
affinity to novel DNA sequences”. The prediction of designed proteins with 
novel interactions to substituted base pairs in the I-Anil recognition sequence 
was performed by mutation and Monte Carlo repacking of amino-acid side 
chains as described in Ashworth et al. 2006 (ref. 16). The template for the design 
calculations was the crystal structure of the I-AniI-DNA complex (Protein Data 
Bank accession number 2QOJ''). Additionally, minor shifts of the protein back- 
bone were modelled only in the vicinity of the designed region using a loop- 
rebuilding algorithm*”*. The specificity of each hypothetical new protein 
sequence for the intended new DNA recognition sequence was calculated as 
the Boltzmann probability of the intended complex versus a partition function 
consisting of each base-pair possibility at the redesigned DNA base pair’’. After 
design, predicted protein sequences with the most favourable binding energy and 
highest predicted specificity were reverted position by position to the wild-type 
amino-acid sequence to identify (and revert) designed mutations that did not 
significantly contribute to the energy or specificity of the designed complex. 
Multi-state design (designs —8G_A and —8G_B). Two base-pair positions in 
he structure were computationally mutated to generate a partial match to a 
recognition site in the IL-2Ry gene in a mouse model of severe combined 
immunodeficiency disease. Specifically, positions —9G:C and —8A:T were 
modelled as —9A:T and —8G:C. A multistate design calculation”* was performed 
o select amino acids at positions 24Z, 26Z, 27Z and 29Z. Three states were 
included in the design. The first state was the target state, which was modelled 
using the altered DNA structure. The second state was the original structure with 
he wild-type DNA sequence, which served as a competitor to enforce binding 
specificity of the selected proteins for the altered recognition site (negative design 
state). The third state was the modelled structure of the best single-state design 
for the target state with the modified DNA sequence; the energy associated with 
his state is a constant during the multi-state design procedure. It represents the 
best scoring protein-altered DNA complex as assessed with the Rosetta energy 
potential, and it is therefore impossible for the energy associated with the target 
state to be lower than this value. As a result, multiple calculations were 
performed which differed from each other only in an artificial offset applied 
to the third state. Progressively larger offsets bias the calculations to select 
sequences that achieve higher specificity for the first state over the second state 
at the expense of achieving Rosetta scores that are allowed to be progressively 
worse than the third state. 

A genetic algorithm was used to evolve a population of sequences that prefer the 
target state to the two competitors. An initial population of 2,000 sequences was 
generated by selecting random amino acids at the four design positions. The side- 
chain conformations of these four residues (with the rest of the protein and DNA 
structure held fixed) were predicted for the first and second states using a Monte 
Carlo algorithm, and the Rosetta score recorded. As noted above, the energy of the 
third state is a constant. A ‘fitness’ score for each sequence i in the population is 
calculated: fitness; = Etarget — <Ezompetitors>» Where Etarget is the energy of the 


©2009 Macmillan Publishers Limited. All rights reserved 


doi:10.1038/nature08508 


target state and the angled brackets denote an ensemble (Boltzmann-weighted) 
average over the energies of the competitors. Conceptually, the fitness corre- 
sponds to the transfer free energy of the protein from the ensemble of competitors 
to the target state. Subsequent generations were constructed using the following 
procedure. First, the sequence with the best (lowest) fitness was promoted auto- 
matically. Next, 1,980 sequences were created by recombining two members of the 
population using uniform crossover of two parents chosen by tournament selec- 
tion.” Finally, the remaining 19 sequences were generated by mutating a single 
parent chosen by tournament selection with a 25% chance of randomizing each 
position in turn. A fitness value was calculated for each new sequence, and the 
population was propagated for 30 generations. 
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Do you think this award 
will change your career? 
Yes, I expect it will have an 
impact on my career. Awards 
such as this can be used as a 
springboard for applications 
for funding, promotions or 
new job opportunities. In 
particular, the ProSPER. 
NET-Scopus award is for 
scientists younger than 40, 
which is extremely important 
given that science is sucha 
difficult game in which to get 
permanent positions. 


What is the biggest 
challenge for today's 
young scientists? 

It is simply getting out of 
the postdoc cycle and into 
a permanent faculty or 
full-time position. There’s 
not an excess of funding 
in any country, so it is very 
competitive to get funding 
and there are limited 
opportunities for permanent 
employment. 


Do you have a research 
strategy to further your 
career? 

My tactic up to this point 
has been to balance 
some risky, technically 


lan Anderson, an ecologist at the University of 
Western Sydney, Australia, has won the first 
annual ProSPER.NET-Scopus Young Scientist 
award for agriculture and natural resources. 


challenging experiments 
that push the boundaries 
with more straightforward 
research. I consider the 
development of molecular- 
biology techniques, such as 
DNA fingerprinting of soil 
microbes, to be my greatest 
scientific achievement. It 
has opened up new avenues 
of research, helping to 
construct a picture of species 
distributions underground. 
But I balance that work with 
other experiments in which 
I have a more direct idea of 
where they will lead so that 
I can make the currency 

— publications and grant 
income — necessary for a 
young scientist to get the 
elusive faculty position. 


Has ‘sustainability’ driven 
your research career? 

In the early days of my 
career, I was purely 
interested in the ecology 
of a system or organism. 
Having studied these 
natural forest ecosystems 
for the past few years, I 
began to see the broader 
context of how organisms 
fit into ecosystems and into 
the planet in general. So, 
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although I may not have 
realized it straight away, 
my work on soil fungal 
functions in nutrient 

and carbon cycling has 
sharpened my focus on 
sustainability of ecosystems 
from a biodiversity and 
conservation point of view. 


Is there a single question 
that could sustain your 
research career for years? 
For the next 10 years, the 

big challenge for me will be 
turning my research focus 

to climate change. I want to 
understand how these soil 
microorganisms are going to 
respond to climate change 
and what their role is in 
adaptation to and potential 
mitigation of climate change. 
We are really just starting to 
understand how soil fungi 
work, so there is a huge 
amount of fundamental 
ecology to do on the 
organisms themselves. At the 
broadest level, we need to 
understand the basic biology 
of organisms in order to 
address bigger issues such as 
climate change. a 


Interview by Virginia Gewin 


POSTDOC JOURNAL 


On my first day of graduate 
school, excitement and 
anticipation consumed me. 
| have to admit — | was naive. 
| thought learning how to 
conduct research effectively 
would be the only necessary 
scientific training. My 
experiments monopolized my 
thoughts, and my emotions 
often found themselves 
intertwined with the pending 
results. | remember going over 
protocols in my head while 
lying in bed at night to make 
sure that crucial steps had not 
been overlooked. 

However, | have learned 
that a career in science is 


much more than conducting 
experiments. Scientists wear 
many hats. They design 
controlled experiments, but 
they also assume the role 
of public speaker, writer, 
manager and mentor. 

As | consider how my time as 
a graduate student, and now 
as a postdoc, has prepared 
me for a future in science, 
| think about my ability to 
wear these many hats. Over 
the years | have given many 
presentations. I've come to 
enjoy writing manuscripts 
and editorial articles. And | 
am working on improving my 
managerial and mentoring 


The many hats of science 


skills. In overseeing a project 
in the lab to map the locations 
of hundreds of transcription 
factors in the yeast genome, 
for example, I've learned how 
to effectively guide others who 
are dealing with experimental 
roadblocks. 

If skills are hats that 
scientists wear, then I've got 
to collect them all. It's part of 
being a scientist — and it gives 
you an edge in an increasingly 
competitive job market. a 
Bryan Venters is a postdoctoral 
fellow at the Center for 
Eukaryotic Gene Regulation 
at Pennsylvania State 
University, University Park. 
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Women honoured 


This year’s LOréal- UNESCO Awards 

for Women in Science have gone to five 
candidates. Two won for their work on 
toxins: microbiologist Alejandra Bravo 
of the National Autonomous University 
of Mexico in Mexico City and biochemist 
Lourdes Cruz of the University of the 
Philippines Diliman in Quezon City. Two 
awards were given for work on diseases: 
to zoologist Rashika El Ridi of Cairo 
University and cell biologist Elaine Fuchs 
of the Rockefeller University in 

New York. The fifth award went to Anne 
Dejean-Assémat, a molecular biologist at 
the Pasteur Institute in Paris, for her work 
on leukaemia and liver cancers. Each 
laureate will receive US$100,000. 


Geoscientist shortfall 


A report from the US National Science 
Foundation (NSF) predicts that there 
will be a growing national need for 
geoscientists to address the problems 

of climate change, resource depletion, 
energy sustainability and environmental 
degradation. GEO Vision: Unraveling 
Earth’s Complexities through the 
Geosciences, released on 15 October 

by the NSF’s Advisory Committee for 
Geosciences, says geoscientists will 
increasingly be called on to assess how 
human behaviour is affecting Earth and 
its systems. Tim Killeen, NSF assistant 
director for geosciences, predicts that the 
4,000 US geoscientists who graduate each 
year will not be enough to supply these 
needs. 


Burnham expands south 


The Burnham Institute for Medical 
Research in La Jolla, California, has 
opened a new facility in Orlando, Florida. 
The centre, called Burnham at Lake Nona, 
will employ some 250 research scientists 
focusing on cardiovascular pathobiology 
and metabolic signalling and disease. 
Scientific director Daniel Kelly says that 
the facility is recruiting 30 principal 
investigators to lead teams of three to 

five researchers and postdocs. During 

the 8 October dedication of the centre, 

the University of Florida announced that 

it will build a research facility and drug- 
development centre at the new life-sciences 
complex. This centre will hire 15-20 
research scientists over the next three 
years, says Sobha Jaishankar, assistant vice- 
president for research at the university. 
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Fertile grounds 


Can Brazil use its booming economy: and: abundant natural 


resources to become a Lpbtirinta ix esecmnap ey, Gene Russo finds out. 
; ‘ Ps ale 


ust north of central Rio de Janeiro, 

perched on a hilltop overlooking a 

busy road, is an ornate castle that 

seems out of place among the chaotic 
traffic and modest houses of the Manguinhos 
district. But this building (pictured above) 
and its surrounding 800,000-square-metre 
campus has had a key role in Brazil’s growing 
science enterprise. 

The Alhambra-inspired pavilion, built 
between 1905 and 1918, is the headquarters 
of the Oswaldo Cruz Foundation, commonly 
known as Fiocruz. Spearheaded by the 
Pasteur Institute-trained bacteriologist 
after whom it is named, the foundation 
helped develop vaccines to treat devastating 
outbreaks of bubonic plague, smallpox 
and yellow fever in Brazil. Cruz wanted a 
temple for science, an emblem people would 
recognize and respect. Funded by the federal 
government, Fiocruz is now not only central 
to Brazilian public health, it is also a major 
employer and trainer, with nearly 6,000 
researchers in Rio and several other satellite 
campuses around the country. 

Scientists and policy-makers hope that 
modern Brazil's temples of science will be 
its universities, government institutes and 
perhaps even its biotechnology companies. 

At times, Brazilian science has been 
hampered by a brain drain and modest 
private-sector endeavours — largely the result 
of limited venture capital and a culture that 
tends not to encourage entrepreneurs. But it 
could become a major player in international 
research if it can capitalize on its vast natural 
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resources and booming economy. 

That promise comes in part from growing 
government support. A decade ago, science 
funding was erratic, says Luiz Anténio Barreto 
de Castro, the secretary of R&D Policies and 
Programmes at Brazil's Ministry of Science 
and Technology. “We didn’t know what the 
budget was going to be the next year,’ he says. 
But by 2000, the congress had approved funds 
for future budgets, making it possible for 
science-funding agencies to plan long-term 
projects. Taxes on oil companies and other 
industries provided a steady source of support. 
Oil giant Petrobras, for example, must by law 
devote 1% of its revenue towards research and 
development. Although the situation is greatly 
improved, Barreto de Castro notes that federal 
R&D spending is still only slightly above 1% 
of the country’s gross domestic product. A 
government plan to increase that to 1.5% 
by 2010 is off track because of to the recent 
financial downturn, he says. 


State support 
Fortunately for scientists, government 
money comes from the states as well. The Sao 
Paulo state government is by far the biggest 
contributor, sending 1% of all its tax revenue 
to science research — some US$400 million 
each year. This investment has helped the 
state economy to thrive, says Barreto de 
Castro. Some of the other 25 states are now 
attempting to follow suit. 

Still, for some, salaries and benefits are a 
concern. Hugo Armelin, a cell-signalling 
specialist at the state-funded Butantan 


© 2009 Macmillan Publishers Limited. All rights reserved 


NATURE|Vol 461/29 October 2009 


Institute in Sao Paulo, laments the plight of 
his postdocs, saying that they tend to receive 
moderate pay but no benefits. “In Brazil, it’s 
easy to get fellowships for several years,’ he 
says, “but without any benefits,” Postdocs, 
depending on their funding source, typically 
earn $24,000-30,000 per year. But staff 
scientists at Butantan Institute, which, like 
Fiocruz, focuses on vaccines and anti-venoms, 
earn only $20,500 per year when they start. 

That can contribute to brain drain. Some 
fledgling Brazilian researchers who go abroad = 
to do research or a postdoc choose to stay 
away because of more lucrative pay packages 
or more abundant opportunities. Fiocruz, for 
example, employs thousands of scientists, but 
often has 10 or 20 times more applications 
than it has vacancies. 

For years the Brazilian government has 
attempted a difficult balancing act: educating 
scientists and forging ties at more developed 
research institutions abroad, while trying 
to ensure that the bulk of the country’s 
science talent does not leave Brazil for good. 
Traditionally, the government has provided 
ample support for graduate study abroad. 
But today, such scholarships are more 
selective and generally target top universities, 
according to Barreto de Castro. 

To make domestic graduate education 
alternatives more attractive, especially in 
less developed regions, Brazil’s Ministry 
of Science and Technology has started 
a ‘northeastern biotechnology network’ 
programme meant to attract expertise to 
30 institutions and ensure that Sao Paulo, 
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the country’s richest state, is not the only 
destination for science talent. Most of Brazil’s 
academic and private-sector life sciences 
research is concentrated in the richer areas 
such as Sao Paulo (see map). 

Started in 2004, the programme has 
received $15 million in the past five years. 
It allows students to earn their degree after 
stints at multiple institutions. Four hundred 
students receive needs-based funding 
and scholarships from federal and state 
governments. Each student's thesis draws on 
the research of multiple groups. In principle, 
the poorest states benefit most as they often 
don't have the money to fund biotechnology 
PhD programmes. The government 
continues to evaluate the programmme’s 
effectiveness says Barreto de Castro. 


Happy returns 

Many scientists who have returned to Brazil 
after an education or postdoc abroad have 
thrived. Armelin spent three years as a 
postdoc at the University of California, San 
Diego, before attending Harvard Medical 
School on a Guggenheim fellowship in 1982. 
But he decided to come back. “I realized 

I could be a nobody there or a somebody 
here,” he says, laughing. 

Mayana Zatz, now director of the human 
genome research centre at the University of 
Sao Paulo, did a postdoc at the University of 
California, Los Angeles. “I could have stayed, 
she says. “I'm glad I didn’t? Zatz has had a 
hand in building a thriving human genetics 
community at Sao Paulo, and has probably had 
a much greater impact than she would have 
had in the United States. However, she says, 
there were drawbacks. Although funding has 
not been a problem, reagents, for example, are 
costly and slow to arrive, a challenge for a fast- 
moving, competitive field such as genetics. 

The Brazilian government is also 
attempting to help spark private-sector 
science, although with mixed success. One 
federally funded programme offers money 
directly to fledgling businesses, including 
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Brazil's burgeoning life-sciences sector. 
Total number of companies: 253 


LIFE-SCIENCE COMPANIES BY SECTOR 
Bioenergy (4.4%) 
Mixed (8.8%) 


Environment 
(8%) ‘ Number of 
life-science 
Rinna companies 
health i 70-100 


Human health 
(30.8%) 


biotech firms. So far, the government has put 
up $200 million per year since 2006. Gerardo 
Mendoza, head of Bionext, a small biotech 
firm in Sao Paulo, says that this money, so far 
approximately $5.9 million 
for his firm, has been key to 
keeping his business afloat. 
Bionext develops biocellulose 
for potential use in surgical 
repair of the brain and heart. 
Most of Brazil's life-science 
companies focus on human 
health and agriculture (see 
graphic). Money might be 
available, says Mendoza, but 
the real problem is navigating 
the government bureaucracy 
to get approval for clinical 
trials. “It takes a very long 
time,’ he says — often more 
than a year in his experience. 
Yet another challenge 


Gerardo Mendoza: bureaucracy 
is hampering progress. 


intellectual-property transfer can be difficult 
even at Embrapa. Grattapaglia fondly recalls 
his days at North Carolina State University 
where he completed his PhD, acquiring two 
patents based on his thesis 
paper alone. 

Nonetheless, university 
research can lead to 
promising biotech ventures. 
Alellyx Applied Genomics, 
for example, is a growing 
biotech born of successful 
genomics work in 2002. 
Alellyx co-founder Paulo 
Arruda had been a plant 
molecular biologist at 
Sao Paulo’s University of 
Campinas for 30 years. His 
foray into biotech began in 
2000 after co-authoring a 
paper on the first geonome 
sequence ofa plant pathogen 


across all life-science fields is 
moving university research into biotech spin- 
offs. “There are still some wrong views about 
venture capital and scientists and making 
money,’ says Dario Grattapaglia, a forestry 
researcher at the Brazilian agricultural 
research corporation Embrapa, the science 
arm of Brazil's Ministry of Agriculture. 
Grattapaglia, who develops molecular 
breeding techniques to speed tree growth and 
select for different wood characteristics, says 
that Embrapa works with companies more 
often than does the typical university. He 
often collaborates with former students who 
now work at forest product companies. 

At Fiocruz, a centre for developing health 
technology has been established, with 
a new building slated for completion in 
2011 or 2012. “We're looking to establish 
something like incubators,” says Claude 
Pirmez, Fiocruz’s vice-president for research 
and reference laboratories. She anticipates 
that this will mean new products or 
perhaps research leading to clinical trials. 
But bureaucracies persist, and negotiating 
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— which was also the first 
sequencing project led by Brazilian scientists 
(A. J. G. Simpson et al. Nature 406, 151-157; 
2000). A network of 25 labs sequenced 
the bacterium Xylella fastidiosa, which 
attacks citrus trees. Located in the suburb 
of Campinas outside Sao Paulo, Alellyx 
initially sprang up to find applications for 
the sequence. But, says Arruda, plans shifted 
in 2003 with the growing recognition of 
the opportunity presented by the rapidly 
expanding sugarcane-ethanol fuel industry. 
This, they calculated, was their route to 
profitability. Alellyx is hunting for genes 
to improve growth of different sugarcane 
varieties, and its next-door sister company, 
CanaVialis, breeds and markets different 
varieties. The two companies showed enough 
promise to catch the eye of US agribiotech 
giant Monsanto, which bought them in 2008 
for $290 million. But Alellyx is a relatively 
rare case. “The problem is we dont have an 
entrepreneurship culture in students,’ says 
Arruda, who suggests that universities should 
offer more business courses for science 
students and more internships at companies. 

But neither increasing the number 
of research opportunities nor boosting 
the role of the private sector seems an 
insurmountable obstacle for a country 
that just secured South America’s first- 
ever Olympic Games. If the industry- 
academia stigma recedes and salaries 
ascend, and native talent returns home, 
Brazilian life sciences will have plenty 
of promise — and probably plenty more 
temples of science. a 
Gene Russo is Editor of Naturejobs. 


Correction 

The Feature ‘Rising star?’ (Nature 461, 832-833; 
2009) was incorrect in stating that the car- 
maker Toyota was named after its home city. In 
fact, the city was named after the car-maker. In 
addition, the photo on page 833 is of Yoshikazu 


Takeda not Keigo Takeda. 
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rpg 


It's the smells I find most evocative. They 
take me places, short-circuit the other 
senses. A hint of rose; elderflower in 
the garden. The river and its white- 
sanded estuary; the crashing of surf. 
Smoke from the soldering iron; the 
high-school physics lab. The soft- 
ness of her belly and a warm sum- 
mer’s evening. 

Soldering iron? 

Iblink my eyes, trying to focus on 
the swimming greenness around me. 
Iam ona forest floor, dappled sunlight 
picking out — no, that’s not right. Ona 
river bed perhaps, light refracting oddly 
and glinting off dust motes — that doesn’t 
work, either. Besides, I can breathe. The air, 
and it is definitely air, I think I would know 
if my lungs were full of water, is clean and 
sweet, but with a definite odour of elec- 
tronics. Dead electronics. 

My nose itches. I move my hand to — 
I try to move my hand to scratch it, but 
something isn't quite right. My arm wont 
move. I try to sit up, see what’s happening: 
but my body is just as stuck. I wonder if 
I'm drunk. That might explain why I can’t 
remember what happened just before I 
ended up ... wherever here is. 

But here, here something appears. I want 
to say ‘swims into vision, but that’s not 
quite right, either. Nothing is quite right. 
There are two somethings, now. At least I 
think there are two. I can't focus properly. 
Maybe I can focus, and they are meant to 
be that fuzzy? 

They are dark. I want to say they are the 
size of dinner plates, but I can’t tell how far 
away they are. And now they shrink. Are 
they shrinking, or are they — 

Ah. They are moving farther away. And 
they are framed by something that’s shaped 
almost like a — and they’re gone, and now 
they’re back. Is that a nose, do you think? 
A slit in the greenness appears, a dark slit 
that seems to oscillate (and there again 
is the memory of the electronics bench 
swimming just out of my depth) and some 
strange noise in my ears. 

And understanding in my head. 

It is awake. 

The words make themselves under- 
stood, even though I don’t remember 
hearing them. I am surprised that I am 
lucid enough to realize this. But I’ve seen 
this apparition before: my heart sud- 
denly racing, I open my mouth to scream 
but before I can it fills with something 


— something that feels like candy floss, 
squeaky like a balloon; dry like the taste 
of chalk. I try to sit up, buttocks clenching, 
chest straining against the straps holding 
me down, pin pricks of sweat on my brow: 
one drops into my right eye and I can't 
wipe it away. 

There will be a moment of readjust- 
ment. 

I fall back, breathing heavily. I remember 
getting out of a ski lift somewhere in the 
Swiss Alps, fighting to draw oxygen into 
my lungs. But there was the hush of freshly 
fallen snow; here this strange, pervasive, 
persistent borborygmus. 

It is all right. Everything will be okay. 

It’s not a voice, it’s a certainty in your 
mind. Force yourself to look at the crea- 
ture (naming it tames the terror even 
as it engenders it) and focus on it. Take 
in its round eyes; the holes for nostrils; 
the lipless, quivering mouth. Force your- 
self to stay still as three long, thin stalks 
that you suddenly know to be fingers brush 
lightly (oh so lightly, like the touch of 
a hesitant lover — but that is not the source 
of the smell of the solder; this is the scent 
of evening) over your face and remove 
the gag. 

We crashed. There is a problem. 

There is a noise, a real, honest-to-God, 
human noise, like the release of pressure 
from a train's brakes. Light (and can this be 
real sunlight fording this turbid air like a 
frontiersman?) breaks in, and the creature 
seems smaller somehow; no less inhuman 
but not, somehow, as alien. 


© 2009 Macmillan Publishers Limited. All rights reserved 


“Tm still on Earth?” I manage to force 
out. 

Yes. There is no structural damage. We 
need your help. 

A certainty. But why me? I'ma program- 
mer, not an engineer. 

“Show me.” 

The glaucous light changes then, flick- 
ers, moves. I am ona gurney and they are 
transporting me... where? I am being 
raised: my feet come into view, and sud- 
denly my arms are free. Straps yet restrain 
my legs, but the tightness across my chest 
is gone. In front of me, it looks like noth- 
ing so much asa TV screen, or computer 
monitor. Blue screen, with white, alien 
characters. But some, I realize as my 
blood suddenly pounds in my ears, some I 
recognize: 

0x0001000B 0x5043... 

From the endless depths of space they 
came: technology to conquer distances that 
can only be measured in terms of photons; 
a civilization I can't imagine. And yet... 

I look at the blue screen again. Over my 
coughing, the pain in my chest, the tears 
that are suddenly streaming down my 
cheek, I hear them say, trembling, almost 
apologetic: 

It just stopped. We didn’t do anything 
different. a 
rpg: poet, scientist, gadfly. Currently 
rocking the world of information 
architecture after an enforced sojourn in 
the Antipodes. Nowhere is safe. 

Join the discussion of Futures in Nature at 
go.nature.com/QMAm2a 
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